THE INVERSE CONJECTURE FOR THE COWERS NORM OVER 
FINITE FIELDS IN LOW CHARACTERISTIC 
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Abstract. We establish the inverse conjecture for the Gowers norm over finite fields, 
which asserts (roughly speaking) that if a bounded function / : y C on a finite- 
dimensional vector space V over a finite field F has large Gowers uniformity norm 
ll/ll(7=+i(v)i then there exists a (non-classical) polynomial P : — T of degree at 
most s such that / correlates with the phase e{P) — e^'^*^. This conjecture had 
already been established in the "high characteristic case" , when the characteristic of 
F is at least as large as s. Our proof relies on the weak form of the inverse conjecture 
established earlier by the authors and Bergelson 'J', together with new results on the 
structure and equidistribution of non-classical polynomials, in the spirit of the work 
of Green and the first author H^l and of Kaufman and Lovett [2H1 ■ 



1. Introduction 

1.1. The inverse conjecture. Let F = Fp be a finite field of prime order ciiar(F) = p. 
Throughout this paper, F will be considered fixed (e.g. F = F2 or F = F3), and the 
term "vector space" will be shorthand for "vector space over F", and more generally 
any linear algebra term (e.g. span, independence, basis, subspace, linear transformation, 
etc.) will be understood to be over the field F unless otherwise stated. 

If is a vector space, / : — C is a function, and h & V is a shift, we define the 
multiplicative derivative Ahf : — )■ C of / by the formula 

A,/ := (T,/)7 

where the shift operator Th with shift h is defined by Thf{x) := f{x + h). If y is finite, 
and d ^ 1 is an integer, we define the Gowers uniformity norm ||/||;7d(y) by the formula 

||/||^.(V-) := |E,„...,,„„eyA,,...A,J(r^)|l/2^ 

where we use the expectation notation Eagyi/(a) := |^X]aGA/('^) finite non- 

empty set A, with l^l denoting the cardinality of A. We review some basic properties 
of the Gowers uniformity norms in Appendix [Bl 

The inverse conjecture for the Gowers norm in finite characteristic addresses the ques- 
tion of determining those bounded functions / : — i- C with large Gowers norm. To 
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phrase this conjecture correctly in the low characteristic case, we need the notion of a 
non-classical polynomiaQ[. 

Definition 1.2 (Polynomials). Let be a finite-dimensional vector space, let c? ^ be 
an integer, and let G be an additive group. A function P : V ^ G from to G is said 
to be a (non- classical) polynomial of degree ^ d ii one has 

for all hi,..., hd+i, n eV, where Ahf{n) := (Th - l)f{n) = f{n + h) - f{n) is the 
additive derivative of / in the direction h. We adopt the convention that the zero 
polynomial has degree —oo. We denote the space of all polynomials of degree ^ d 
as Poly^^(V — )■ G); this is clearly an additive group, with Poly5;^(V^ — )■ G) = {0} for 
c/ < 0. " 

Remark 1.3. In practice the group G will usually be the unit circle T := R/Z, the 
finite field F, or the embedded copy ^(F) of F in T, where 6 : F — )■ T is the additive 
homomorphism 

dj) := - mod 1. 

P 

In particular, t(F) = ^Z/Z is the group of p*^ roots of unity in M/Z. 

Remark 1.4. Polynomials that take values in F (and by abuse of notation, i{¥)) will 
be referred to as classical polynomials; but in this paper, the term "polynomial" will 
be understood to encompass the non-classical case unless otherwise stated. Clearly 
L induces an isomorphism : Polj^^iV ¥) Poly^^(V^ — > 6(F)). We will take 
advantage of this isomorphism whenever we need to use the multiplicative structure on 
F, since T has no multiplicative structure, save for the fact that it is a Z-module (i.e. 
one can define na when a G T and n G Z). 

Remark 1.5. These notions of polynomials are part of a larger theory of polynomial 
algebra between (filtered) groups that are not necessarily abelian; see Appendix O 

Example 1.6. The map P : F2 ^ T with P(0) := and P(l) := 1/2 is a classical 
polynomial of degree ^ 1. The map Q : F2 — ?■ T with Q{0) = and Q{1) = 1/4 is a 
(non-classical) polynomial of degree ^ 2; note that AiQ = ^ — P. 

We can generalise these examples to higher dimensions. If n ^ 1 is an integer, we let 
L : F2 — )■ Z be the function L{xi, . . . , x„) := . where x 1— )■ |x| is the obvious 

map from F2 to the fundamental domain {0, 1}. This map is not a polynomial (either 
in the classical or non-classical sense); however, the function ^ mod 1 is a classical 
polynomial of degree ^ 1 from F2 to T, j mod 1 is a (non-classical) polynomial of 
degree ^ 2 from Fj to T, and more generally, for any k ^ 0, mod 1 is a (non- 
classical) polynomial of degree ^ k from F2 to T. For further generalisation of these 
examples, see Lemma iLTT ii). (iii) below. 



The relevance of (non-classical) polynomials to the Gowers norms can be seen from the 
easily verified fact that if / : — )■ C is a function on a finite-dimensional vector space 

""^Strictly speaking, "not necessarily classical polynomial" would be a more accurate terminology 
than "non-classical polynomial" . 



FINITE FIELDS IN LOW CHARACTERISTIC 



3 



V that is bounded in magnitude by 1 (thus |/(a;)| ^ 1 for all x G V), and d ^ he 
an integer, then ||/||(7d(y) ^ 1, with equality if and only if / is of the form / = e(P) 
for some polynomial P : — t- T of degree at most d, where e : T — t- C is the standard 
character e{x) := e^'^*^. 

We collect the following standard facts about polynomials, setting V to equal the stan- 
dard finite-dimensional vector space F*^ in order to use coordinates: 

Lemma 1.7 (Basic facts about polynomials). Let ^ = F" for some natural number n, 
G be an additive group, and d be an integer. 



(i) If P : V ^ G is a function and d 0, then P G Po\y^^{V — )■ G) if and only 
if AhP G Poly^^_;^(y — )■ G) for all h & V. In fact, we may replace "for all 
h eV" by "for all h in a set that generates V". 

(ii) If d ^ 0, then a function P : V ¥ is a polynomial of degree ^ d if and only 
if it has a representation of the form 

P{xi, . . . , Xn) = . . x^" (1-1) 

0!£n,...,i„<p:n+...+j„^ii 

for some coefficients Ci^^,,,^i^ G F, and furthermore these coefficients are unique. 

(iii) If d 0, then a function P : V ^ T is a polynomial of degree ^ d if and only 
if it has a representation of the form 

P{x,,...,xn) = a+ Yl ^"-'""'^.^r"' modi (1.2) 

^ ii, . . . , i„ < p; j 5s : 

< ii + . . . + j„ ^ d- j{p~ 1) 

for some coefficients Ci^^,„^i^^^j G {0, 1, ... ,p — 1} and a G T, where x i— )■ |x| is 
the map from F to the fundamental domain {0, 1, . . . ,p — 1} G Z. Furthermore 
the coefficients Ci-^^,,,^i„j and a are unique. 

(iv) If R is a commutative ring, and P,Q : V R are polynomials of degree ^ d 
and ^ d' respectively, then PQ : V R is a polynomial of degree ^ d + d' . 

(v) The map p : P ^ pP is a homomorphism from 

Poly^^(V ^ G) ^ Poly^„,ax{d-p+i,o)(^ ^ G). 

If G = T and d ^ 0, then this homomorphism is surjective. 

(vi) If P E Po\j^^{V — )■ T) and d 0, then there exists a G T such that P 

takes values in the coset a H — of the (pL^J+i^f* roots of unity. In 

particular, P takes on at most p^p-^^ distinct values. 



Proof. See Appendix [Dl □ 

Remark 1.8. If follows from part (vi) that ii d < p then the set of polynomials of 
degree < in T coincides (up to constants) with the set of classical polynomials of 
degree < d. However, this statement is false in the low characteristic case p ^ d. 

Remark 1.9. We isolate one of the claims in the above lemma for special comment, 
namely the surjectivity claim in part (v). This claim implies that every polynomial 
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P of degree ^ d for some d ^ has a p^^ root Q (i.e. pQ = P) which is a (non- 
classical) polynomial of degree ^ d + p — 1. We refer to the ability to take p^^ roots 
while losing exactly a factor of p — 1 in the degree as the exact roots property. The 
exact roots property plays a crucial role in our proof of the inverse conjecture with the 
correct degree of polynomials involved. Unfortunately, this property does not hold in 
the ergodic theory setting, which is one reason why our arguments here do not proceed 
via the ergodic theoretic approach; see Appendix |E] for further discussion. 

We now connect non-classical polynomials to the Gowers norms. From the monotonicity 
of the Gowers norms and the Cauchy-Schwarz-Gowers inequality (see Lemma iB.l( ii). 
(vi)) we see that if / : — > C correlates with a polynomial P G Poly^^^^(V^ — )■ T) in 
the sense that 

K^vf{x)e{-P{x))\;^S 
for some 6 > 0, then we have ||/||c/''+i(y) ^ S. 

The inverse conjecture for the Gowers norm over F is a converse of this statement: 

Conjecture 1.10 (Inverse conjecture GI(s)). Let 6 > and s ^ 0. Then there exists an 
£ = £s,s,¥ > such that for every finite- dimensional vector space V and any 1-bounded 
function f : V ^ C with \\f\\u''+'^iv) ^ ^, there exists P G Poly^5(\^ — j- T) such that 

\E^^vfix)ei-Pix))\^e. 

For a fixed s, we denote the above conjecture as GI(s). We now briefly review the 
history of progress on this conjecture. The case GI(0) is trivial, while the case GI(1) 
follows easily from Plancherel's theorem. The result was established for GI(2) in [18] (for 
odd characteristic) and (for even characteristic), and a formulation of Conjecture 
11.101 was then conjectured in both papers. In that formulation, the polynomial P was 
assumed to be a classical polynomial rather than a non-classical one. In subsequent 
work [22], [S3], it was shown that this "classical" formulation of the conjecture could 
fail in the low characteristic regime p ^ s; however the counterexamples in these papers 
did not prevent the "non-classical" formulation of Conjecture 11.101 given above from 
holding in those cases. 

The case when 6 is sufficiently close to 1 (depending on s) was treated in [T], while 
the case when the characteristic p is large compared to s and 6 was established in 
[5B] . In [23, Conjecture 11.101 was also established in the case when / = e(P) for some 
P e Po\y^p{V — )■ T). Finally, in ^43j, ^J, Conjecture 11.101 was established in the high- 
characteristic case s < p, and a weaker version of this conjecture established in the 
low-characteristic case (see Theorem 111.31) . 

The first main result of this paper is to extend the high characteristic result from [43] . 
[3] to the low characteristic case also: 

Theorem 1.11. GI(s) is true for all choices of¥ and s. 

Remark 1.12. As stated at the beginning of the introduction, we are restricting F to 
be a field of prime order. But finite fields Wpj of prime power order are also covered 
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by this theorem, since any vector space over ¥pj can also be viewed as a vector space 
over Fp, with no change in the definition of the Gowers norm or the definition of a 
(non-classical) polynomial. 

Remark 1.13. In [16], Gowers and Wolf used the high characteristic case of Conjecture 
ll.lOl to compute the true complexity of a system (a concept introduced in [H]) of linear 
equations in sufficiently high characteristic. In principle. Theorem 11.111 would allow the 
"sufficiently high characteristic" condition to be weakened or dropped entirely. However, 
this would require adapting the arguments in [l6j from classical polynomials to non- 
classical polynomials, and furthermore the high characteristic hypothesis is also used 
elsewhere in the arguments (in particular, the characteristic was assumed to exceed the 
"Cauchy-Schwarz complexity" of the system being studied). 

1.14. Rank and analytic rank. Theorem 11.111 is established as a consequence of a 
related result (Theorem 11.201 below), which is the main technical result of the paper. 
Before we can state that result, we first need to recall the useful notions of rank and 
analytic rank, following [T6] . 

Definition 1.15 (Rank and analytic rank). Let s ^ be an integer, and let P G 
Poly^,^i(V^T). 

• The rank rank(P) = ranks(P) of P is the least number m of polynomials 
Qi, . . . ,Qm G Poly<;<)(^ T) of degree ^ s such that P is a function of 
Qi, ■ ■ ■ , Qm, i-e. there exists a function F : T™ — )■ T such that P = F{Qi, . . . , Qm) 
(We adopt the convention that ranko(P) is infinite if the linear polynomial P 
is non-constant.) 

• The analytic rank arank(P) = aranks(P) of P is defined to be the quantity 
arank(P) := - logp ||e(P)|| J/.+i^^^. 

We define the rank and analytic rank of polynomials P G Poly<;^_,_i(y — ?■ F) by using 
the homomorphism t^,, thus for instance arank(P) := arank(i,,P). (In particular, when 
defining the rank of a classical polynomial P, we allow for P to be represented by 
non-classical polynomials Qi, . . . , Qm of the required degree.) 

The analytic rank is closely related to the derivative d'^^^P : — )■ T of P, defined as 

d'+'P{h, . . . , K+i) := A^, . . . Ah,^,P{x) (1.3) 

for any hi, ... , hs+i,x G V (note that the right-hand side is independent of x when P 
has degree at most s + 1). Indeed, a short calculation shows that 

E,„...,,,^,eye(d^+ip(/ii, . . . , /i.+i)) = p---'^(^). (1.4) 

Thus for instance arank(P) vanishes if and only if P is of degree ^ s. 

For a classical quadratic form Q : V ¥, the rank rank(Q) = ranki((5) and the analytic 
rank arank(Q) = aranki(Q) are both equal (at least when p is odd) to the usual concept 
of the rank of a quadratic form in linear algebra; see [L6l for further discussion. 
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Example 1.16. For each k ^ 1, let 5*^ : F2 — )■ F2 be the degree k classical symmetric 
polynomial 

^ki,-^ly ■ ■ ■ y -^n) ■ ^ ^ -^ii ' ' ' -^ij. 

or equivalently, using the notation L{xi, . . . , x„) := |a;i| + . . . + |a;„| from Example 11.6^ 
we have 

Sk = mod 2. 

A classical theorem of Lucas (reflecting the self-similar fractal nature of Pascal's triangle 
modulo 2) then gives the identity 

whenever k = 2°"^ + . . . + 2^*™ is the binary expansion of k\ thus for instance 5*3 = S2S1, 
5*5 = 5'45'i, 5*6 = 5'45'2, 5*7 = 8^8281, etc. An easy induction on m then shows that each 
S*^'" is a binary coefficient of L, or more precisely that 



L = ^\82^\T^ (1.5) 

m=0 

(note that only finitely many of the summands are non-zero). 
In [33], [22] it was computed that 

l|e(i(54))||^4(Fn) = \ + 

as n — )■ 00, and thus arank(S'4) = arank3(S'4) = 3 -|- o(l). As for the rank of 5*4, it 
can be shown that as n — )• 00, 5*4 cannot be expressed as a function of a bounded 
number of classical cubics; see [33], [22]. However, from (11. 5p we see that ^4 is a 
function of the expression ^ mod 1, which is a cubic by Lemma fl.7( iii). We conclude 
that rank(S'4) = rank3(S'4) = 1. 



We collect here some basic observations regarding rank and analytic rank: 

Lemma 1.17 (Basic properties of rank). [16] Let s ^ Q he an integer, and let P,Q & 
Poly ^g^iiy — 7- T) be polynomials. 

(i) rank(P) = rank(— P) and arank(P) = arank(— P). 

(n) rank(P + Q) ^ rank(P) + rank(Q) and arank(P + Q) ^ 2*+i(arank(P) + 
arank((5)). 

(iii) arank(P) ^ Csrank(P), for some constant Cg > depending only on s. 

(iv) We have the inequality 

s+l 

|E,„...,,,^,eye(rf^+ip(/ii, . . . , /i,+i)) n fjih, . . . , /i,+i)| ^ k(P)/2^ 

whenever fj : — )■ C are 1-bounded functions, with each fj{hi,...,hs^i) 
independent of the hj variable. 
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Proof. The claim (i) is trivial, as is the first part of claim (ii). The second part of (ii) 
is established in [151 Lemma 5.9]. Now we turn to (iii). If we write rank(P) = m, 
then P is a function of poljTiomials Qi, . . . , Qm of degree ^ s. By Lemma ll.7( vi). we 
may assume that the Qi, . . . , Qm all take values in the (p'")*'^ roots of unity for some 
C = C{s). By Fourier analysis, we may thus decompose 

e(P) = ^ Ca^,...,a^e{aiQi + . . . + a^Qm) 

where the Fourier coefficients Ca^^...^a,n are bounded in magnitude by 1 (in fact their £^ 
norm is bounded by 1). By the pigeonhole principle, one can thus find such 
that 

|E,eye(P(x))e(-aigi(x) - ... - a^Q^(x))| ^ p-'"^ 
which by the monotonicity of Gowers norms implies that 

\\e{P)e{-aiQi - ... - amQm)\\u''+^{v) ^ p"""^- 

Since the U^^^iV) norm is invariant with respect to modulation by polynomials of 
degree ^ s, we conclude that 

and thus arank(P) ^ 2^Cm, and the claim follows. 

Finally, the claim (iv) is established in fiQ\ Lemma 5.4]. □ 



Lemma ri.l7( iii) asserts that the rank controls the analytic rank. For each s ^ 0, we let 
GIP(s) denote the following converse: 

Conjecture 1.18 (Inverse conjecture GIP(s) for polynomials). Let 6 > 0. Then there 
exists an integer K = Ks,s,¥ > such that for every finite-dimensional vector space 
V and any Q G Poly^^^i(\^ — t- C) with ||e((5) ^ ^, there exists Pi,...,Pk G 
Poly^^(\^ — > T) such that Q is a function of Pi, ... , Pk (i.e. Q = P(Pi, • • • , Pr) for 
some function F : — )■ T. 

Remark 1.19. This type of conjecture was introduced by Bogdanov and Viola [S]. The 
case GIP(O) is trivial, and the case GIP(l) follows from Fourier analysis. In the high 
characteristic case p > s + 1, the result was established in [221 Proposition 6.1]. 

Our main technical theorem is then 

Theorem 1.20. GIP(s) is true for all choices of¥ and s. 



Theorem 11.111 can be deduced from Theorem 11.201 and the "weak" form of the inverse 
conjecture for the Gowers norm established in [13], [3], together with an argument from 
[T6] : we give this (standard) argument in Section [TTl 

We remark that a different approach using ultrafilters to the structural theory of the 
Gowers norms is in the process of being carried out in [27], [M], [H]- 



8 



TERENCE TAG AND TAMAR ZIEGLER 



We are indebted to the anonymous referee for a careful reading of the paper and many 
useful suggestions. 



2. An outline of the argument 

In this section we give an informal outline of how we will prove the inverse conjecture 
for polynomials GIP(s) (and hence the full inverse conjecture GI(s)), suppressing many 
technical details (for instance, we will leave terms such as "bounded" vague for now, 
but such concepts will be made rigorous shortly with the assistance of nonstandard 
analysis). We will also identify F with l(¥) = ^Z/Z for the purposes of this discussion. 

As with many other arguments in this subject, we will induct on the degree parameter 
s, and assume that the conjecture GIP(s') has already been proven for all s' < s. For 
sake of exposition we will work in the low characteristic case p ^ s, which is the hardest 
case. 

Informally, the induction hypothesis allows one to obtain a completely satisfactory 
equidistribution theory for all (non-classical) polynomials of degree less than or equal 
to s, at least in principle. For instance, if P : — )■ T is a polynomial of degree s' + 1 for 
some s' < s that is of high rank, one can use GIP(s') to describe the equidistribution 
of the tuples {P{x + hicoi + . . . + hdCOd))oji,...,u}a€{o,i} ^ T^'^'^^ for fixed d, as x,hi, . . . , hd 
ranges uniformly over V . The precise equidistribution results we will need are rather 
technical to state, and will be formalised in Lemma 16.41 and Lemma 19.81 

The conjecture GIP(s) asserts, informally, that any non-classical polynomial of degree 
^ s + 1 of bounded analytic rank, also has bounded rank. This conjecture can be 
established from three sub-claims, which we informally state as follows: 

(i) (Multiplication by p) If P is a non-classical polynomial of degree ^ s + 1 
of bounded analytic rank, then pP is a non-classical polynomial of degree ^ 
s — p + 2 of bounded analytic rank. 

(ii) (Division by p) If Q is a non-classical polynomial of degree ^ s — p + 2 oi 
bounded rank, then there exists a non-classical polynomial P' of degree ^ s + 1 
of bounded rank such that pP' = Q. 

(iii) (Classical case) If P : V" — )■ F is a classical polynomial of degree ^ s + 1 of 
bounded analytic rank, then there exists a classical polynomial Q : V ^ ¥ of 
degree ^ s + 1 and bounded rank such that d^~^^P = d^^^Q, where the top 
order derivative d^^^P : V^^^ — t- F of P was defined in (11. 3p . (In other words, 
P and Q differ by a polynomial of degree strictly less than s + 1.) 

Indeed, assume these three claims hold. Then if P has bounded analytic rank, then by 
Claim (i), pP has bounded analytic rank, and hence is of bounded rank by induction 
hypothesis. By Claim (ii), we can thus find a polynomial P' of bounded rank (and 
therefore also of bounded analytic rank) such that pP' = pP, so that P — P' is classical 
while still having bounded analytic rank. By Claim (iii), P — P' differs from a classical 
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bounded rank polynomial Q by a polynomial of degree strictly less than s + 1, and so 
P has bounded rank also, as required. 

It remains to verify the three claims. After performing a Fourier expansion, Claim 
(i) will be an easy consequence of the multidimensional Szemeredi theorem for vector 
spaces, first proven by Bergelson, Leibman, and McCutcheon [4j; we will establish it in 
Section O 

We defer discussion of Claim (ii) for now, and move on to Claim (iii), which is somewhat 
easier to establish. The strategy here is to obtain as much structural information on the 
expression d^^^P as possible, so that the bounded rank polynomial Q can be constructed 
more or less explicitly. It is easy to see that d'^^^P : — )■ F is a symmetric multilinear 
form. The fact that P is classical gives an additional cancellation property, namely that 
d'^~^^P{hi, . . . , hs+i) necessarily vanishes whenever at least p of the hi, ... , hg+i are equal 
to each other. For instance, if p = 2 and s + 1 = 4, we have 

d'+^P{a,a,h,c) = 0, 

as can be deduced from the identity 

AaAaP = {T2a ' 2Ta + 1)P = 2P - T,(2P) = 

since one has 2P = when P is classical. We will refer to symmetric multilinear forms 
with this cancellation property as classical symmetric multilinear forms. 

The fact that P has bounded analytic rank implies (from (11.41) ) that the form d^^^P is 
biased, in the sense that Kh^^___^hs+iev€-{d'^~^^P{hi, . . . , hs+i)) is large. To exploit this, we 
use a general equidistribution result of Kaufman and Lovett[28] to conclude that d^~^^P 
must be expressible in terms of lower degree classical symmetric multilinear forms. We 
then apply a standard "regularity lemma" (analogous to those in [22], [2H]) to make these 
forms "independent" of each other, which makes them jointly equidistributed in a certain 
technical sense (see Lemma [72] for a precise statement). With this equidistribution, one 
can control the precise manner in which d^^^P is a function of the lower degree forms, 
and we will end up showing that d'^^^P is a certain symmetrised combination of such 
forms (on a bounded index subspace). This will be made more precise in Section [6l but 
a typical example occurs when s + 1 = 4, in which the expression 

d'^P{a, b, c, d) 

will be expressed as a linear combination of terms such as 

B{a, b)B{c, d) + B{a, c)B{b, d) + B{a, d)B{b, c) (2.1) 
where B : V x V is a. (classical) symmetric bilinear form. 

To conclude Claim (iii), we thus need to rewrite expressions such as (12. ip in the form 
d^~^^Q, where Q is of bounded rank. For sake of argument let us work specifically with 
the example (12. ip . As B is itself classical, it can be expressed as -B = d^R for some 
classical quadratic polynomial R : V In the high characteristic case p > 2, one can 

then proceed simply by setting Q := i?^/2!, as the claim can be verified from the discrete 
Leibniz rule flD.2p . However one cannot proceed so easily in the low characteristic case 
p = 2, as one can no longer divide by 2! in this case. Instead, we lift the polynomial 
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R (which takes values in F = Z/2Z) to the larger cyclic group Z/4Z, obtaining a 
cubic polynomial R : V ^ Z/4Z which projects back down to R in the sense that 
R = R mod 2. We then set Q := (2) mod 2 (instead of -R^/2!); the point is that the 
sequence n i— )■ (2) mod 2 is periodic with period 4 and so this expression is well defined. 
One can show that Q is a quartic polynomial, which is clearly of bounded rank as it 
depends only on the cubic polynomial R, and one can also compute that 

d^g(a, b, c, d) = B{a, b)B{c, d) + S(a, c)S(6, d) + 5(a, d)B{h, c) 

which gives the desired representation of (12. ip . The same arguments work in more 
general degrees and characteristics to give Claim (iii) in general; see Section [61 Note 
that these constructions rely heavily on the classical nature of i?, and hence of B and 
d^P, and ultimately exploit the multiplicative structure of the classical range F that is 
not present in the non-classical range M/Z. 

Finally, we return to Claim (ii). Let Q be a non-classical polynomial of degree ^ s—p+2 
of bounded rank, thus Q is some combination of polynomials of degree strictly less than 
s—p + 2. By a "regularity lemma" argument, it will turn out to be possible to express Q 
as the combination of "independent" polynomials Pi, . . . , of various degrees between 
1 and s — p + 2., with each Pj taking values in some cyclic group Z/p-^^Z, thus 

Q = P(Pi,...,P,) 

for some function F : {'L/p''^'L) x . . . x {%jp'^^%) — )■ T. The precise nature of this 
independence is somewhat technical to state, but it implies good joint equidistribution 
properties on the Pi, . . . , P^ (see Proposition 19.81 for a formal statement of this). The 
fact that Q has degree ^ s — p + 2 will imply that a suitable "weighted degree" of F is 
also at most ^ s — p + 2. 

For technical reasons, the polynomials Pj of degree 1 will cause some difficulty (their 
p^^ roots will not have high rank). But one can trivially eliminate all such polynomials 
by passing to a finite index subspace on which these polynomials are constant. As such, 
one can easily reduce to the case where all polynomials Pj have degree at least 2. 

It would be convenient if we could then find another function G : (Z/p'^iZ) x . . . x 
(Z/p'^'^Z) — )■ T of "weighted degree" ^ s + 1 such that pG = F, as the function P' := 
G{Pi, . . . , Pfc) would then obey the necessary requirements for Claim 2. Unfortunately, 
this claim turns out to be false in general. However, what one can do is first use the 
exact roots property from Lemma ILTt v) to obtain a p^^ root Pj for each Pj with the 
"right" degree, thus pPj = Pj. Because the Pj have degree at least 2, it turns out that 
the independence properties of the Pj are inherited by the Pj. One can then rewrite Q 
as Q = F'{P[, . . . , Pfe) where F' : (Z/p-^^+^Z) x . . . x (Z/p-^^'+^Z) ^ T is the puUback 
of F. By analysing the concept of weighted degree for periodic functions on Z*^ (and in 
particular by breaking such functions down into multinomials), we will be able to find a 
function G' : (Z/p'^^+'^Z) x . . . x (Z/p'^^+'^Z) — T of the right weighted degree such that 
pG' = P, and then the function P' := G'{P[, . . . , P^) will obey the properties required 
for Claim (ii). 

The detailed proof of Claim (ii) will occupy Sections lUlfTUl 
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3. Taking ultralimits 

To prove Theorem 11.201 it will be convenient to pass from the finitary setting to an 
infinitary one, in order to eliminate a profusion of epsilons, deltas, and growth functions 
(such as the growth functions T that appear for instance in [22], [28]); it also allows us 
to conveniently make rigorous such phrases as "the polynomials Pi, ... , Pm are linearly 
independent modulo bounded rank errors", which would otherwise only make sense 
heuristically or would need to be quantified with additional parameters. In [13] the 
Furstenberg correspondence principle was used to convert the inverse conjecture to a 
statement in an infinitary branch of mathematics, namely ergodic theory. Unfortunately, 
the ergodic theory framework has a drawback in the low characteristic setting, namely 
that there does not appear to be an easy way to take roots of polynomials in this setting 
in a degree-efficient manner (see Appendix [E]) . To overcome this technical obstacle we 
shall take a different infinitary formulation of the problem, namely an ultralimit (or 
nonstandard analysis) formulation. (Such formulations have also appeared in other 
recent work on the inverse conjecture [21], [37].) 

The basic machinery of ultralimits and nonstandard analysis is recalled in Appendix 
Rl as is the asymptotic notation (such as X ^ F or X = 0(Y)) associated with this 
machinery. 

We now translate all of the terminology used for Theorem ll.20l to the ultralimit setting. 

Let V he a non-empty limit finite set (i.e. an ultralimit V = limn_5.Q, Vn of standard 
non-empty finite sets Vn) and let / : ^ — )■ *C be a limit function on V (thus / is an 
ultralimit / = limn-s>a /n of standard functions fn '■ Vn C). Then we can define the 
expectation K^^zyfix) of f on V in the usual fashion by the formula 

^xGvfix) := lim E^.^gv'n/n(a;n)- 

This will be a limit complex number. If further ^ is a limit finite-dimensional vector 
space (i.e. each Vn is a standard finite-dimensional space), and ci ^ 1 is a standard 
natural number, then we can similarly define the uniformity norm 

•= liin ||/n||i/d(y„) 

which will be a limit non-negative real number. 

Given a limit finite-dimensional space V and a standard integer G Z, we let Poly^^(V — ?■ 
*T) denote the space of all limit polynomials P of degree ^ d from V to *T, i.e. all ultra- 
limits P = limn^Q Pn of polynomials Pn G Poly<;^(K -> T). We define Fo\y^^{V -> F) 
and Po\j^^{V '-(F)) for limit finite-dimensional V similarly. 

We make the simple but important remark that all the claims in Lemma II. 7[ that were 
established for finite-dimensional vector spaces, extend to the limit finite-dimensional 
setting (replacing all operations by their limit counterparts) by taking ultralimits. In 
particular, the surjectivity claim in Lemma [TTTT v) extends to the limit finite-dimensional 
setting. 
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If s ^ is standard and P G Poly<;^_,_i(y — )■ *T), we can define the rank, rank(P) = 
ranks (P), and analytic rank, arank(P) = aranks(P), which are now non- negative hmit 
integers and non-negative hmit real numbers respectively. We let Poly^^^^ br(^ ^ *'^) 
and Poly^s+i bar(^ ~^ denote the limit polynomials P G Poly^s+i(y *T) of 
degree < s + 1 that are of bounded rank and bounded analytic rank respectively. From 
Lemma 11.171 we see that Poly^js+i.BRl^ ~^ *'^) Poly^g_,_i bar(^ ~^ *'^) vector 
spaces with the inclusions 

Poly^,(V ^ *T) ^ Poly^,+i,BR(V ^ *T) ^ Poly^,+i,BAR(^ ^ *V ^ Po\y^,^,iV ^ *T). 
We similarly define Poly^,+i_BR(^ -> ^) and Poly^,+i^BAR(^ 

We can now give the ultralimit formulation of GIP(s): 

Theorem 3.1 (Ultralimit equivalence of GIP(s)). Let s ^ be standard. Then GIP(s) 
holds if and only if, for every limit finite- dimensional vector space V , Polj^g_^_i br(^ ^ 
*T) = Poly<js^i bar(^ ^ *^) bounded analytic rank and bounded rank are equiva- 
lent). 

Proof. We first assume GIP(s) and verify that Poly^^+i br(^ ^ *'^) = PoIYscs+i.barI^ ^ 
*T). Accordingly, we let P G Poly<j5_|_i bar(^ ~^ and need to show that P has 
bounded rank. 

By construction, we can write V = rin-^Q: as the ultraproduct of finite-dimensional 
spaces Vn, and similarly write P = limn-i^a Pn as the ultralimit of polynomials Pn : Ki — ^ 
T. Since P has bounded analytic rank, the Pn have bounded analytic rank uniformly in 
n (at least for n sufficiently close to a). Applying GIP(s), we conclude that the Pn have 
bounded rank uniformly in n, thus we can find polynomials Qn,i, • • • , Qn,K : Ki — ^ T of 
degree ^ s and a function Pn : T-^^ — ^ T such that Pn = Pn(<5n,i, • • • , Qn,K)- Writing 
Qi := hmn-!.aQn,i and F := limn^aPn, we conclude that Qi, ■ ■ ■ ,Qk : V *T are 
limit polynomials of degree ^ s and P = F{Qi, . . . , Qk), and so P has bounded rank 
as desired. 

Conversely, suppose that Poly^^.,.]^ br(^ ~^ *'^) = PoIYsss+i bar(^ ~^ *'^)- We as- 
sume for contradiction that GIP(s) failed. Thus, there exists a sequence Ki of finite- 
dimensional vector spaces and polynomials Pn : Ki T of degree ^ s + 1 whose analytic 
rank is bounded uniformly in n, but whose rank goes to infinity as n — )• oo. Setting 
V := Hn-s^a ^ and P := limn^a-Pn, we see that P : V — )• *T is a limit polynomial of 
degree ^ s + 1 of bounded analytic rank, and hence of bounded rank by hypothesis, thus 
P = F{Qi, . . . , Qk) for some bounded m, some limit polynomials Qi, . . . , Qk : V *T 
of degree ^ s, and some function F : — )■ *T. Writing Qi := limn^a Qn^i and 
F := limn-s.aPn, we conclude that Pn = Pn(Qn,i, • • • ,Qn,m) for all n sufficiently close 
to a, and thus the rank of Pn is bounded uniformly for such n, which gives the desired 
contradiction. □ 

A similar argument (which we omit) gives the ultralimit formulation of GI(s): 

Theorem 3.2. Let s ^ be standard. Then Gl{s) holds if and only if, for every limit 
finite- dimensional vector space V and every bounded limit function f : V ^ *C (where 
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"bounded" means that sup^^y\f{x)\ is bounded), \\f\\u=+'^{v) ^ 1 if and only if there 
exists P e Poly^,(l^ -> *T) such that |E^6v/(a;)e(-P(x))| > 1. 

It remains to establish that Poly<jj,+i br(^ ~^ *"^) = PoIy^s+i.barI^ This is the 

purpose of the remaining sections of the paper. 

4. Splitting into three subclaims 

To prove the claim Poly^,+i br(^ ^ *T) = Poly^,+i^BAR(^ ^ *T) (and hence GIP(s)), 
we split this theorem into three subclaims as outlined in Section |2l Claim (i) is easily 
formalised: 

Theorem 4.1 (Multiplication by p). Let V be a limit finite- dimensional vector space. 
If k > p is a standard integer, then the map P t— ?■ pP maps Poly<;j BAhi^ ~^ *'^) 

Poly^,-p+i,BAi?(^^*T). 

We prove this theorem in Section |3 Claim (ii) is also easily formalised, though for 
technical reasons (having to do with the need to eliminate all linear polynomials that 
arise in the regularity lemma) it is convenient to weaken the claim to a "virtual" version 
in which one only obtains roots on a bounded index subspace: 

Theorem 4.2 (Exact roots). Let s ^ 1 be a standard integer such that GIP(s') is true 
for all ^ s' ^ s . Then for every ^ s' ^ s, every limit finite-dimensional V , and 
every P G Poly^s/_,_i br(^ ^ there exists a bounded index (limit) subspace V ofV 
and Q e Poly^^z+p br(^' *T) such that pQ = P on V . 

We will prove this theorem in Sections [QlfTOl 

To formalise Claim (iii) properly, we will need some additional notation. 

Definition 4.3 (Multilinear maps). Let \^ be a limit finite-dimensional vector space 
and k ^ 1. 

• A k-linear map is a limit map T : V'' W such that if one fixes all but 
one variable hi of the k variables hi,...,hk G V, the remaining map hi i— )■ 
T{hi, . . . , hk) is linear. 

• A fc-linear map is symmetric if it is invariant under permutations of the k 
variables hi, . . . ,hk. 

• A symmetric fc-linear map is classical if T{hi, . . . , hk) vanishes whenever at 
least p of the hi, . . . ,hk agree (this condition is of course vacuous for k < p). 
Thus, for instance, if p = 2 and = 3, a symmetric trilinear form T is classical 
if T(a, a, 6) = for all a,h & V . 

We abbreviate "classical symmetric multilinear" as CSM. We denote the space of clas- 
sical symmetric /c-linear maps T : l^'^ ^ F as CSM'=(1^); this is clearly a vector space. 
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Example 4.4. Let V = ¥'^ and k = 2. The map 

T{h, h') := hih'j 

where h = {hi, . . . , h' = {h[, . . . , h'^) is muhihnear, but not symmetric or classical. 
The map 

T'{h,h'):= J2 ^^^'3 
is symmetric and multilinear, but not classical. The map 

T"{h,h'):= Yl ^4 

is symmetric, multilinear, and classical, and thus lies in CSM^(\^). 

Multilinear maps are naturally associated to derivatives of polynomials. Indeed, from 
the cocycle equation 

Ah+h' = Ah + Ah' + AhAh' 
and the commutativity identity 

AhAh' = Ah' Ah 

we see that for any P G Poly^j^{V — )■ *T), d'^P is multilinear and symmetric. In 
particular, from multilinearity d'^P must take values in 

From Lemma ID. 31 we see that A^ and pAh differ (multiplicatively) by an invertible 
formal differential operator, which is equal to —1 plus higher order terms. Applying 
this to a polynomial P G Poly^^(\^ — j- *T) with k ^ p, we conclude the identity 

d'^Pih, ...,huh2,..., hk-p+l) = -d'-P+\pP){h, h2,..., hu-p+l) (4.1) 

for any hi, ... , h^-p+i G V, where hi appears p times on the left-hand side (recall that 
pP is of degree ^ — p + 1, by Lemma [1.71) . 

This identity has a number of consequences. For instance, it gives some additional 
constraints on d''P beyond symmetry, such as 

d'^P{hi, . . . , hi, h2, . . . , /ifc-p+i) = d^P{hi, /12, . . . , h2, h^, . . . , /ife-p+i), 

where p copies of hi appear on the left, and p copies of /i2 appear on the right. 

Another consequence will be important for us: 

Lemma 4.5 (Derivative of classical polynomials). Let k ^ 1 and V be a limit finite- 
dimensional vector space, then d maps 

Folj^kiV ^) io CSM^{V ¥), and further- 
more this map is surjective. In other words, we have the short exact sequence 

Poly^fc_i(y ^ F) ^ Poly^kiV ^ ¥) ^ CSM\V) 0. 

To state the above lemma loosely, classical symmetric multilinear forms are nothing 
more than the derivatives of classical polynomials. 
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Proof. Applying (14. ip to a classical polynomial P G Poly^^(y — )■ F), we have pP = 0, 
and thus d'^P is classical, which gives the first claim. 

Conversely, suppose that T e CSM*'(l^). It suffices to verify the claim when V is finite 
dimensional, as the limit finite-dimensional case then follows by taking ultralimits. We 
take advantage of the finite-dimensionality to write = F" (without loss of generality), 
then the multilinear form T can be expressed in coordinates as 




where hi^i, . . . , hi^n ^ are the coordinates of h^, and Ci-^^,,,^i^ G F are coefficients. From 
the symmetric nature of T we know that the Ci^^...^i^ are symmetric with respect to 
permutations of the indices ii, . . . ,ik', from the classical nature of T we know that the 
Cii,...,ifc vanish whenever p or more of the ij are equal. We thus see that T is an integer 
linear combination of expressions of the form 

^ I J2 ^I'^i • • • I 

\{ii,...,ik}=A f 

where {ii, . . . , i^} is the multiset formed by ii, . . . , z^, and A is a multiset of k elements 
taking values in {1, . . . with the multiplicity of each 1 ^ j ^ n in A being less 
than p. A short computation then shows that each such expression can be expressed as 
^fcp for a polynomial P G VcA^^^iy — )■ F), indeed we may take 

Note how the multiplicity bound aj < p allows for the factorial ajl to be inverted in F. 

(One could also use the binomial coefficient (^^) in place of here if desired.) The 
claim now follows from linearity of d''. □ 



Claim (iii) will be a variant of Lemma [4.51 in the biased case: 

Theorem 4.6 (Inverse Cowers for classical symmetric multilinear forms). Let s ^ 2 he 
such that CIP(s') is true for all ^ s' < s, let V he limit finite-dimensional, and let 
T G CSM'+^(y F) he such that |E/,,,...,/,^^^6ye(6(T(/ii, . . . , hs+i)))\ > 1. Then there 
exists a hounded index (limit) suhspace V ofV and P G Poly^^_,_;^ br(^' ^ ■^'^^/i that 
d'+^P = T on {V'y+\ 

This theorem will be proven in Sections |6]|8l To state the theorem loosely, classical 
hiased symmetric multilinear forms are nothing more than the derivatives of classical 
bounded rank polynomials. 

In the remainder of this section, we show how Theorems 14. 1^ H72| 14.61 imply CIP(s). We 
first need a technical lemma to handle the passage to bounded index limit subspaces. 
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Lemma 4.7. Let k ^ 2 and P G Poly^^(y — )■ *T), and let V be a bounded index 
subspace of V . (Note that such subspaces are automatically limit subspaces.) Let P' G 
Poly^fc(V *T) be the restriction of P to V . 

(i) P has bounded analytic rank if and only if P' has bounded analytic rank. 

(ii) P has bounded rank if and only if P' has bounded rank. 

If k = 1, then the "only if" portions of the claim continue to hold. 

Proof. We begin with (i). If P' has bounded analytic rank, then by (11 ■4p 

\Eh,,...,h^evlv'{.hi) . . . lv'{hk)e{d^P{hi, hk))\ > 1. 
Applying Lemma I1.17( iv) we conclude that P has bounded analytic rank, as required. 

Conversely, suppose that P has bounded analytic rank, then by fll.4p we have 

Eh,,...,h,Gve{d''P{hi,...,hk)) > 1. 

Applying Fourier analysis, we conclude that for ^ jT^I'^"^ k — 1-tuples {hi, . . . , hk-i) 
in V'^'^, one has d^P{hi, . . . , hk-i, ■) = 0. Applying the pigeonhole principle, one can 
restrict hk-i to a single coset of V' and still have the above claim. Using multilinearity 
in one can in fact restrict to V' . Iterating this argument one may restrict all 
of /ii, . . . , hk_i to V . Using Fourier analysis we conclude that 

Eh,,...M^v'e{d^P'{hi,...,hk)) > 1, 

and hence by f lL4p . P has bounded analytic rank as desired. 

Now we show (ii). The "only if" portion is trivial, so we focus on the "if" part. By 
induction we may assume that V is a hyperplane in V , and we may then write V = V x 
¥ without loss of generality. Suppose that P' is of bounded rank. Letting n : V V he 
the coordinate projection from V x¥ to V, it is easy to verify that P' o vr is a bounded 
rank polynomial of degree ^ k. Subtracting this from P, we may assume without loss 
of generality that P vanishes on V'. 

On each coset of h + V of V , P is of degree ^ k — 1 (since AhP is of degree ^ k — 1). 
Thus Ih+v'P is a function of finitely many polynomials of degree ^ k — 1. Summing 
over a set of coset representatives h we obtain the claim. □ 

Now we can prove GIP(s) assuming Theorems 14. ![ ^^21 14.61 

Proof of GIP (s) . By induction we may assume that GIP(s') holds for all ^ s' < s; 
our task is now to show that GIP(s) holds. We will assume s ^ 2, as the s = 0, 1 cases 
are well known. 

Let P G Poly^s+i bar(^ ~^ *'^)- By Theorem 13. it will suffice to show that P has 
bounded rank. 
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Let us first suppose that s + 1 > p (which is the most difficult case). By Theorem 
El pP G Poly<;^_p+2,BAR(^ -> so by GIP(s - p + 1), pP is of bounded rank. 
By Theorem 14.21 (with s replaced by s — 1), there thus exists Q G Poly^;^.,.! br(^' ~^ 
*T) C Poly^^_,_]^ bar(^' ~^ *'^) such that pP = pQ on a bounded index subspace V of 
V. In particular, P — Q takes values in l{W) on V, so by Lemma [4.51 d^~^^{P — Q) G 
CSM'+^(r ^ *T). 

By Lemma I^Tl P. Q both lie in Poly^^^^ bar(^ ~^ *'^)) ^^ud so P — Q does also. Thus 



Applying Theorem 14.61 (pulling back by l) we conclude that d^~^^{P — Q) = d'^^^{W) on 
^^'/^s+i some G Poly^^+i BRl^" ~^ '•(^^)) some bounded index subspace V" 
of V, thus P — Q — ly is a polynomial of degree ^ s + 1 on V fl V". Since Q, W are 
of bounded rank on V (1 V", we conclude that P is of bounded rank on V (1 V", and 
hence on V by Lemma 14.71 

The case s < p is similar. Here, pP G Poly^]^(\^ — *T), hence on passing to a subspace 
pP is constant, and in particular we can find a constant Q such that pP = pQ on this 
subspace. One then argues as before. □ 

It remains to prove Theorems 14. ![ 14. 2[ 14.61 This is the purpose of the remaining sections 
of the paper. 



In this section we prove Theorem 1 4.11 The main tool will be the multidimensional 
Szemeredi theorem over finite fieldqj from [4J , which we can formulate as follows: 

Proposition 5.1 (Multidimensional Szemeredi theorem). Let V be a limit finite- dimensional 
vector space, let k ^ 1 be a standard integer, and let A G be such that \A\ ^ \V\''. 
Then there exist ^ |\^|'^"^^ tuples {hi, . . . , hk, h) & V such that 



Proof. We combine the results from [4j with an averaging argument of Varnavides . 

The claim is trivial if V has bounded dimension, so we may assume that V has un- 
bounded dimension. 

Since \A\ ^ IV]'', if we write A = limn^^ and V = limn^.a V^, then there is a 
standard 6 > such that \A,^\ ^ (5|V^n|^ for all n sufficiently close to a. 

Fix n with this property. Let M be a large standard integer depending on k, 6 to be 
chosen later. As V has unbounded dimension, we may assume that has dimension 
at least M by taking n sufficiently close to a. 



^h,,...,h.^,^ve{d'^\P - Q){hi, . . . , K+i)) » 1. 



5. Multiplying by p 




{{hi + hui, h2 + hu2, . . . ,hk + huk) : oJi, . . . , G F} C A. 



'It is also possible to prove this proposition using the density Hales- Jewett theorem[5] instead. 
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Let Wn be an arbitrary linear subspace of of dimension M. We can then foliate 
into (iFnI/lWnl)'' cosets x + W^ of W^. The average value of {A^ n {x + W^)\/\Wn\^ 
in these cosets is at least 6. Thus, for at least 5(|Ki|/| W^n|)'^/2 of these cosets, we have 
\A^n{x + W^)\/\W^\''^6/2. 

Consider one of these cosets x + with fl (a; + W^n)l/I^n|'' ^ S/2. Applying [H 
Corollary 5.4], we see that if M is large enough depending on k,6, we can thus find a 
non-zero h G Wn and {hi, . . . , hk) E x + such that 

{{hi + hui, h2 + hu2, . . . ,hk + huk) : Wi, . . . , Wfe G F} C A^- (5.1) 

Summing over all such cosets (and using the pigeonhole principle to fix h), we can thus 
find a non-zero h G such that (15. ip holds for at least Ck,M,s\Vn\'' tuples {hi, ... , hk) G 
V^, where Ck,M,5 > is standard. Averaging over all possible linear subspaces H^n of 
V-a of dimension M, we conclude from a routine double counting argument that in fact 
(15. ip holds for at least c'^ msIKiI'^'''^ tuples {h,hi, . . . ,hk) G V^^^, where c'f. ^ > is 
standard. Taking ultralimits as n — )■ a we obtain the claim. □ 

Now we can prove Theorem 14.11 



Proof of Theorem \4-l\ Let P G Poly^;j_^^^(\^ — )■ *T). From fll.4p one has 

Ef,,_f,^^ve{d'P{hi,...,hk)):^l. (5.2) 

For each tuple {hi, . . . , hk-i) G V''~^, the function e{d''P{hi, . . . , h^-i, ■)) is a character 
on V. Thus, by Fourier analysis, for (15. 2 p to hold we must have ^ tuples 
{hi, . . . , hk-i) G V''^^, for which the form 



d^P{hi,...,h 



k 



vanishes identically. Applying Proposition l5.H we can thus find ^ |V^| tuples {h,hi, . . . ,hk 
such that 

d^P{hi + uih, ...,hp + Uph, hp+i, h^-i, ■) 

vanishes identically for all {ui, . . . ,Up) G {0, 1}^. (Note how the hypothesis k > p was 
needed here in order for this expression to make sense.) Taking an alternating sum of 
these expressions, we thus have 

d^P{h, . . . ,h, hpj^i, . . . , hk-i, ■) 

vanishing identically for ^ \V\'' tuples {h, hi, ... , hk-i) G V'', and thus for ^ | V"|^^^+^ 
tuples {h, hp, ... , hk-i) G V'^-p+i. Once again this implies that 

Eh,hp+i,...,hfcGve(rf''P(/i, ...,h, hp+i, hk-i, hk)) > 1, 

and the claim follows from (II. 4p . and (14. ip . □ 

For future use, we record an immediate corollary of Theorem 14.11 

Corollary 5.2. Suppose that k > p and GIP(/c — p) holds. Then the map p : P ^ pP 

maps Poly^,,BR(^ ^ *T) to Voly^k-p+iM^ ^ *T). 
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We will use Corollary 15.21 in the following contrapositive sense: if k > p, GIP(A; — p) 
holds, and P G Poly^^(\^ — j- *T) is such that pP has unbounded rank (as a polynomial 
of degree ^ k — p + 1), then P has unbounded rank (as a polynomial of degree ^ k). 
Informally, roots of high rank nonlinear polynomials remain high rank. 

We remark that Theorem 14. II (and Corollary 15.21) fail when k = p. For instance, if p = 2 
and V = ¥2, then the function P{x) := \x\/4 mod 1 is a bounded rank polynomial of 
degree 2, but 2P{x) = \x\/2 mod 1 is an infinite rank polynomial of degree 1. Because 
of the failure of Theorem 14.11 at the endpoint k = p, we will need to require certain 
polynomials to have degree at least two in our arguments; but we will be able to eliminate 
all linear polynomials from our analysis by exploiting the freedom to pass to finite index 
subspaces. 



6. Multilinear concatenation 

We now begin the proof of Theorem 14. 6[ The strategy will be to obtain enough control 
on the biased form T G CSM'^'^^(l^) that one can explicitly write this form as d^^^P on 
a bounded index subspace V for some bounded rank polynomial P G Poly^;^.,.! br(^')- 

To get some intuition as to what expressions such as d'^^^P look like, consider the case 
when s + 1 = 4, V^' = V^, and P takes the form P = QR for some quadratic polynomials 
Q,R ^ Polj^2BR(y ~^ Clearly, P is of bounded rank. A brief computation using 
the discrete Leibniz rule (lD.2p then reveals that 

AaA,A,A4QR) = {AaA,Q){A,AdR) + {AaA,Q){AhAdR) + (A,A,Q)(A,A,i?) 

+ (A,A,g)(A,A,/?) + (A,Arfg)(A„A,i?) + (A,ArfQ)(A,A,i?) 

for any a,b,c, dev. Thus, if we let B,C e CSM^(\/ F) be the quadratic forms 
B := d'^Q, C := d'^R, then we have 

d^P(a, 6, c, d) = B{a, b)C{c, d) + S(a, c)C(6, d) + 5(a, d)C{b, c) 
+ B{b, c)C{a, d) + Bib, d)C{a, c) + B{c, d)C{a, b). 

By Lemma [4. 5 1 we thus see that any quadrilinear form T G CSM'^{V) of the form 

T(a, 6, c, d) = B{a, b)C{c, d) + S(a, c)C(6, d) + 5(a, d)C{b, c) 
+ Bib, c)C(a, d) + Bib, d)C{a, c) + B{c, d)C{a, b) 

for some B,C E CSM^(V^) will be of the desired form d*P for Theorem 14. 6[ 

To generalise this discussion to higher dimesions, we introduce the following operation. 

Definition 6.1 (Concatenation). Let S G CSM'=(\/) and T G CSM'(I^) for some stan- 
dard integers k,l^l. We define the concatenation S*T e CSM^^\V) of S and T by 
the formula 

{l,...,k+l}=AiSB 
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where the sum ranges over all partitions of {1, . . . ,k + 1} into a /c-element set A and 
an Z-element set B, and we define S'((/;,j)jg^) by enumerating A arbitrarily (the precise 
ordering is not relevant due to the symmetry of S), and similarly for T{{hj)j^B)- Thus, 
for instance, if ^, T G CSM^iV), then 

{S * T){a, 6, c, d) := S{a, b)T{c, d) + S{a, c)T{b, d) + S{a, d)T{b, c) 

+ S{b, c)T{a, d) + S{b, d)T{a, c) + S{c, d)T{a, b). 



It is not hard to see that S* * T is indeed multilinear and symmetric. The fact that it 
is classical is also easily seen after observing that all the binomial coefficients with 
1 ^ J < P are divisible by p and thus vanish on F. The operation * is also easily 
seen to be bilinear, commutative, and associative. As the previous discussion already 
indicated, this operation is closely related to multiplication on (classical) polynomials. 
More precisely, we have: 

Lemma 6.2 (Product rule). Let k,l ^ 1 be standard integers. If P E Poly<^(V" F) 
and Q G Poly^;(V ^ F), then d^+\PQ) = {d^P) * [SQ). 



Proof. We apply the discrete Leibniz rule ( ]D.2|) repeatedly to expand out the derivative 

A,, . . . A,,^X^Q) (6.1) 

with hi, ... , hk+i £ y ■ Note that if P accepts more than k derivatives, or Q accepts 
more than / derivatives, then the resulting term in the expansion of (16. ip vanishes. 
Thus the only terms in (16. ip that survive are those in which P is differentiated exactly 
k times, and Q differentiated exactly I times. Collecting all such terms one obtains the 
claim. □ 



From this lemma and Lemma [4.5[ we see that any form T G CSM''^^(K) which can be 
expressed (possibly after passing to a bounded index subspace) as a linear combination 
of concatenations Ti * . . . * of classical symmetric multilinear forms of degree strictly 
less than s + 1, will satisfy the conclusions of Theorem 14.61 

In the high characteristic case p > s + 1, it turns out that these concatenations are 
the only expressions one needs to consider to establish Theorem 14.61 Unfortunately the 
situation is more complicated in the low characteristic case p ^ s + 1. This can be 
illustrated by using the symmetric polynomial 5*4 G Poly^4(F2 — ?■ F2) from Example 
11.161 (this is on a finite-dimensional space rather than a limit finite-dimensional space, 
but let us ignore this technicality for this discussion). A routine calculation reveals that 
the quartilinear form d^S^ G CSM^(F2) can be expressed as 

d^Si{a, b, c, d) = B{a, b)B{c, d) + 5(a, c)5(6, d) + 5(a, d)B{b, c) (6.2) 

for all a, 6, c, G F^, where B G CSM2(F^) is the bilinear form B = d^S2, thus 

B{a, b) := aibj. 

(The identity (16. 2p . which was already observed in [53], [22], can be established by test- 
ing it on generators a, b,c,d E {ci, . . . , e„}.) The right-hand side of (16. 2 p is formally of 
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the form {B * B)/2\, but the operation of dividing by 2! is not well-defined in charac- 
teristic two, and so in fact one cannot easily express fl6.2p in terms of the concatenation 
operation. Instead, we have to introduce a new operation to handle expressions of this 
form: 

Definition 6.3 (Symmetric power). Let T G CSM''(V) for some standard integers 
k ^ 2 and m ^ 1. We define the symmetric power Sym™(T) G CSM™'^(V^) by the 
formula 

Sym'"(T)(/ii, . . . , h^k) = J2Il nihiheA), 

A A€A 

where the sum ranges over all partitions .4 of {1, ... , mk} into m subsets A of cardinality 
k each. For instance, ii m = k = 2, then 

Sym2(T)(a, 6, c, d) = T{a, b)T{c, d) + T(a, c)T(6, d) + T(a, d)T{b, c). 

Again, it is clear that Sym^(T) is symmetric and multilinear; the fact that it is classical 
follows by observing that when p of the arguments of Sym'^(T) are set to be equal, then 
the multiplicity of each term is a multiple of p (as it is equal to p\ divided by a number of 
factorials that are strictly less than p!; note here we use the hypothesis k ^ 2). Because 
of our need to avoid the k = 1 case, we will have to take some care to eliminate all 
linear forms from the arguments in the next section, by using the trick of passing to a 
finite index subspace to make these forms vanish. 

Observe that 

m!Sym'^(T) =T*...*T 

where the right-hand side contains m copies of T. Thus, in high characteristic p > m, 
one can write the symmetric power in terms of the concatenation operation by the 
formula 

T * *T 

Sym-(T) = . (6.3) 

ml 

However, in the low characteristic case the symmetric power operation cannot be re- 
duced easily to the concatenation operation, and we need to consider the two operations 
separately. 

There is an analogue of Lemma 16.21 

Lemma 6.4 (Symmetric power rule). Let k ^ 2 and m ^ 1 be standard integers, 
and let T G CSM'^(I^) be a classical symmetric multilinear form. Then there exists 
Q G Poly^^fc(\/ ^ F) such d"'^Q = Sym™(T). Furthermore, if m ^ 2, then Q has 
bounded rank. 

Proof. By Lemma [4.51 we may write T = d^P for some P G Poly^^(l^ — > F). 

By a limiting argument it suffices to establish the claim when V is finite dimensional, as 
long as the bound in "bounded rank" depends only on p, k, m and not on the dimension 
of V. 
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Heuristically, in view of f l6.3p and Lemma 16.2^ it is natural to try to set Q equal to 
P"^/m\. This works in the high characteristic case m < p, but not in the low character- 
istic case p ^ m due to the non-invertibility of ml. To get around this, we will use the 
binomial coefficient (^) instead of P"^/m\; but this requires lifting P to a larger group 
than F = Z/pZ. 

We turn to the details. We let M ^ be the first integer such that m < p'^^'^^, 
thus ^ m. Using Lemma ILTf y) repeatedly, we may find a polynomial Pm € 
Poly ^i.^]^^p_i^(y —7- T) such that p^^ Pm = '•(-P)- In particular. Pa/ takes values in the 
^pM+iyh j^QQ^g Qf unity. We may thus pull Pm back to the cyclic group Z/p^^^Z to 
obtain a polynomial Pm G Poly^;,_,_j,j(p_;^)(V — )■ Z/p'^^^^Z) such that Pm = P mod p. 

An inspection of the formula (") = fQ^- binomial coefficient reveals 

\rn/ m(m—l)...l 

that the map n i— i- mod p is periodic with period whenever m < p^^^. In 

particular, by abuse of notation we may define the binomial coefficient (^) mod p G F 
whenever m < and n G Zjp^^^Z. We then set Q := i^^^ mod p. 

We first verify that Q is a polynomial of degree ^ mk. For inductive reasons, we will 
prove the more general claim that for any j ^ 0, any hi, . . . ,hj E V, and any ^ m' < 
pM+i^ the expression ( ''^ ^,''^^") has degree at most k ~ j + {m' — 1) max(A; — j, 1). 
Clearly, this implies the previous claim by setting j := and m' := m. 

We first address the degenerate case when k —j + (m' — 1) max(A; — j, 1) is negative, so in 
particular m' ^ j — k. The polynomial A/^^ . . . A^^Pm has degree ^ A; + M{p — 1) — j, so 
by Lemma ri.7( v). it is divisible by p""*"^ whenever ^ a ^ M and k + a{p — 1) — j <Q. 

In particular, A/^^ . . . Ah^PM is divisible by p^ p-'^ . On the other hand, observe that 

(^,) is divisible by p whenever n is divisible by and m' < p". Since m' < p^ p-^ 
we obtain the claim. 



To handle the non-degenerate cases when k — j + {m! — 1) max(fc — j, 1) ^ 0, we use 
downward induction on j. The claim is vacuously true for j sufficiently large, so we 
assume inductively that the claim is proven for all larger values of j; for fixed j, we also 
assume inductively that the claim is proven for all smaller values of m' . By Lemma 
ll.7r i). it suffices to show that the expression 

has degree at most k — j + (m' — 1) max(/i; — j, 1) — 1 for all hj^i G V . 
From the combinatorial identity 

fn + n' 
\ m' 
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we see that 

-<:M:rr)C/_.) 

whenever h e V and F : V ^ Z/p^^+^Z. We may therefore expand (16.41) as 



E 

i=l 



Ah, . . . Ah^.^PM\ [Ah,...Ah^Pi 



M 



m' — i 



In each summand, we apply the two induction hypotheses to conclude that the first 
factor in the summand has degree ^ k — j~l + {i — l) max{k—j — 1,1), and the second 
factor has degree ^ k — j + {m' — i — 1) max(A; — j, 1). A routine computation shows 
that 

{k—j — l+{i—l) max{k—j—l, l)) + {k—j+{m'—i—l) max(A;— j, 1)) ^ k—j+{m'—l) max{k—j, 1) — 1 

whenever i ^ 1 (treating the cases k — j > 1, k — j ^ 1 separately), and the claim then 
follows from Lemma [1.7( iv). Thus Q has degree ^ mk as desired. 

Now we compute the derivative d^^'^Q of Q = [^^) . Using (16. 5 p we have 

^ / Pm\ _ f AftP/vf \ / Pm 

for any h . By the above computations, the polynomial ('^''f*^) (^-j) degree 

^ {{k - 1) + (z - l){k -l)) + {k + {m-i- l)k) = mk - i. 

In particular, all the terms with i > 1 have degree strictly less than mk — 1 and thus 
will not contribute to d"^^(^^y The i = 1 term can be simplified as A/iP(^^^-^). We 
conclude that 

^mk fPM\ ^^^^ _ _ _ ^ ^^^^ ^ d^'-\{An,^,P) ( hrnk-l). 

\m J \m — IJ 

Expanding this out using Lemma 16.21 we have 



J2 d'P{h,„...,hi,_„hrnk)d'^"'-''>'( ^'O 



{hj,,...,h 



where 1 ^ ji < . . . < j{m-i)k < ^k are the ordered enumeration of the set {1, . . . , mk — 
l}\{ii, . . . The claim d"^^{^^^ = Sym™((i'^P) then follows by induction on m. 

□ 

Example 6.5. We illustrate the above lemma with p = m = k = 2, with = F" and 
P = S2 the symmetric polynomial from Example I1.16[ thus 5*2 : — F is the classical 
quadratic polynomial 

S2{xi, . . . , Xn) = ^ ^ XiXj. 
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The bilinear form B := d"^ S2 '■ ¥ is then given as 

B{{xi,...,Xn),{yi,...,yn)) = 

this is a classical symmetric bilinear form. 

Lemma 16.41 asserts the existence of a classical quartic Q : V ^ ¥ with 

(i^g(a, b, c, d) = B{a, b)B{c, d) + 5(a, c)B{h, d) + 5(a, c). (6.6) 

By (16.21) we see that we can take Q = 6*4. To connect this with the proof of Lemma [6. 4[ 
we recall that P = S2 = L mod 2, so if we set Pi := L mod 4, then P = Pi mod 2, and 
= (^1) mod 2. 

Theorem 14.61 now follows from a more explicit claim: 

Theorem 6.6 (Explicit inverse Gowers for classical symmetric multilinear forms). Let 

s ^ 2 be such that GIP(s') is true for all ^ s' < s, let V be limit finite- dimensional, 
and let T e CSM'^\V) be such that \Eh,,...,h,+,eveiLiT{hi, . . . , hs+i)))\ > 1. Then there 
exists a bounded index subspace V ofV such that on {V')'^^^, T is a linear combination 
( over ¥) of a bounded number of expressions of the form 

Sym"^i(5i)*...*Sym"'-(S,) (6.7) 

for some mi, . . . , ^ 1 and 2 ^ ki, . . . , kr < s + 1 and Si G CSM'^'(y) for i = 1, . . . , r 

with 

rriiki + . . . + rrirkr = s + 1. 

Indeed, by repeatedly applying Lemma 14.51 we find that 5*^ = d^'Pi for some Pi G 
Poly<fc^(y ^ F). By Lemma [621 Sym'^'(d'='Pi) = d''^"'^Qi for some Qi G Polj^^^j^^{V 
F), and Qi is of bounded rank if rrii > 2. Now by Lemma [6l2| we see that any expression 
of the form (16. 7p can be expressed as d'^'^^Q on V where Q = YliQi ^ Poly^s+i br(^' ~^ 
F) (note that the product of at least two polynomials of degree > 1 is necessarily of 
bounded rank), and Theorem 14.61 follows by linearity. 

It remains to establish Theorem 16.61 To illustrate the type of result one is seeking here, 
in the case s + 1 = 6, one has a classical symmetric sextilinear form T G CSM^(\^) 
which is biased in the sense that 

|Ea,b,c,d,e,/gye(i(T(a, 6, c, rf, e, /)))! > 1, 

and one wishes to conclude that on a bounded index subspace V of V, T{a, b, c, d, e, /) 
can be decomposed into a bounded number of pieces such as the expression 

Bia,b)B{c,d)B{eJ) 

plus 2!2^!3T — 1 = 14 Other permutations (for some B G CSM^(V)), adding up to 
Sym=^(P), or 

B{a,b)B{c,d)B'ie,f) 

plus 2!^!2! — 1 = 44 other permutations (for some B,B' G CSM^(V^')), adding up to 
Sjm\B) * B', or 

C{a,b,c)C{d,e,f) 
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plus g|||2| — 1 = 9 other permutations (for some C G CSM^(V^')), adding up to Sym^(C), 
or 

B{a,h)D{c,d,eJ) 

plus ^ - 1 = 14 other permutations (for some B G CSM^(r) and D G C^M\V')), 
adding up to S * Z^. 



7. EQUIDISTRIBUTION of MULTILINEAR MAPS 



In order to establish Theorem 16.61 (and thus Theorem 14.61) we will need an equidistri- 
bution theory for classical symmetric multilinear maps, analogous to that in [22], |28j . 
We introduce some definitions: 

Definition 7.1 (Bounded rank for multilinear forms). Let k ^ 1 and let V be limit 
finite-dimensional. A form T G CSM'^(I^) is said to be hounded rank if there exist a 
bounded number of forms Si G CSM'^'(V^), i = 1, . . . , m for some 1 ^ h < k such that 
for hi, . . . ,hk E V, the expression T{hi, . . . , hk) is a function of expressions of the form 
Si{hj-^, . . . ,hji^ ) for some 1 ^ i ^ m and 1 ^ ji < . . . < ^ k, and unbounded rank 

otherwise. The space of bounded rank forms will be denoted CSMgj^(\^); it is clearly a 
subspace of CSM^V). 

Thus, for instance, if k = 4 and T takes the form 

T(a, b, c, d) = B{a, b)B{c, d) + 5(a, c)5(6, d) + 5(a, d)B{b, c) 
(i.e. T = Sym^(S)) for some B G CSM^(V^), then T would be bounded rank. 

Our starting point will be the following result of Kaufman and Lovett [2B] (which in 
turn is based on the earlier paper [22]), which links bounded rank with bias: 

Proposition 7.2 (Bias criterion). Let k ^ 1 be a standard integer, and let V be limit 
finite- dimensional. Then a form T G CSM'^(V) is bounded rank if and only if 

\&h„...M^ve{i{T{hi,...,hk)))\:^l. 

Proof. The "only if" part is easy: if T is bounded rank, then by Fourier analysis, 
the function e{i{T{hi, . . . , hk))) is a bounded linear combination of functions of the 
form e(L(Y^^iCiSi{hj-^, . . . , hj^_))) for q G F. Such functions can be factorised as 

Y['j=i fji^iy • • • ! hk) where the fj are bounded functions not depending on hj. On the 
other hand, we clearly have 

Ehi,...,hfceye(t(T(/ii, . . . , hk)))e{L{T{hi, hk))) = 1. 

Applying the pigeonhole principle we conclude that 

d 

|E^i,...,/.,eye(6(T(/ii, . . . , hk))) l[fjih,,...,hk)\ > 1 

j=i 

for at least one collection fi, ■ ■ ■ , fd of bounded functions; the claim then follows from 
Lemma [1.1 7( iv) (and Lemma [4. 5p . 
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We now turn to the "if" part. This result follows easily from Theorem 2]. Indeed, 
applying that theorem, we see that T{hi, . . . , h^) is a function of a bounded number 
of polynomials A^T{hi, . . . , h^) for v G V'^, but by the multilinearity of T we see that 
all such derivatives are functions of expressions of the form Si{hj^, . . . , /i^^, ) for some 
1 ^ ji < . . . < jfci ^ k, as required. □ 



Using the above proposition, we can obtain a criterion for joint equidistribution for 
certain systems of CSM forms; the precise definition of joint equidistribution is given in 
Definition IC.20I We need another definition: 

Definition 7.3 (Systems of CSM forms). Let fco ^ 1 be standard, and let V be limit 
finite-dimensional. A CSM-system T = {Tk^i)i<^k<ko;i^i^m,, of degree < fco is a collection 
of forms Tk^i G CSM^iV) for 1 ^ k < k^ and 1 ^ i ^ mk, where mi, . . . ,mk-i are 
standard natural numbers. A CSM-system is said to be regular if, for each 1 ^ k < ko, 
the forms i, . . . , Tk^m^, ^ CSM^{V) are linearly independent modulo CSM|j^(\^), thus 
one has 

a,Tk,i + ... + akTk,m, ^ CSM^j,(l^) 
whenever ai, . . . , G F are not all zero. 

Let S G CSM'^(l^) be a classical symmetric multilinear form of some (standard) degree 
1. We say that S is measurable with respect to a CSM-system T = {Tk,i)i^k<ko-i^i^mk 
if there is a functional relationship of the form 

S = F((T„),eA) 

where A is the set of tuples 

CK = {ka, iai ia,li ■ ■ ■ i ja,ka) 

with 1 ^ ka < ko, 1 ^ ia ^ f^k^i and 1 ^ ia,i < ja,2 < . . . < ja,ka ^ d (in particular, 
this forces ka ^ d), and F : — )■ F is a function. 

Example 7.4. A collection Li, . . . , : V — )■ F of linear forms, together with a 
collection Bi, . . . , : V^^ — )• F of classical symmetric bilinear forms, will form a CSM- 
system of degree < 3. In order for this system to be regular, the linear forms Li, . . . , 
must be linearly independent, and no non-trivial linear combination of the . . . , 
can be of bounded rank (i.e. expressible in terms of boundedly many linear forms). A 
quartilinear form such as Bi * B2, Bi * Li * L2, Bi * Sym^(Li), Sym^(Si), or linear 
combinations thereof, will be measurable with respect to T (provided that mi,m2 are 
large enough so that these expressions make sense, of course). 

Proposition 17.21 can now be recast as follows: 

Corollary 7.5 (Bias criterion, again). Let k ^ 1 be a standard integer, and let V be 
limit finite- dimensional. Let T G CSM'^(I^). Then one has 

\Kh„...,h,^ve{i{T{hi,...,hk)))\ > 1 

if and only if T is measurable with respect to a CSM-factor S = {Sk' ^i)i(^k'<k;i!ii!imf., of 
degree < k. 
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The CSM-factor S given by the above corollary is not necessarily regular, but we may 
always regularise it as follows: 

Lemma 7.6 (Regularity lemma). Let ko ^ 1 he a standard integer, and let T = 
{Tk,i)i!ik<ko;i!ii!imk ^e a CSM-factor of degree < ko. Then there exists a regular CSM- 
factor S = {Sk,i)i^k<ko;i^i^m'^ of degree < ko, such that every multilinear form Tk^i in T 
is measurable with respect to S. 

Proof. We assume inductively that the claim has already been proven for all smaller 
values of fco (this hypothesis is vacuous for kQ = 1). 

We consider the top-order forms Tfcg.i^i, . . . , Tfco_i^mj,^_^ G CSM'^''^"'^(V^), projected to 
the quotient space every finitely generated vector space 

has a finite basis, we may thus find forms Sk^-i i, • • • , ^fco-i m' e CSM''«"^(V) that 

are linearly independent modulo CSMgJ^^(y), thus that each Tkg-i^i{hi, . . . ,hk) is a 
linear combination (over F) of the Sko-i,i{hi, . . . , hk), ■ ■ ■ , SkQ-i m' {hi, ... , hk), plus a 
bounded rank form of /ii, . . . , hk, which by definition of bounded rank can be expressed 
as a function of a bounded family of forms U{hi^, . . . ,hi^,) with 1 ^ k' < k, U & 

Cm^'{V), audi < ... <ik' ^ k'. We may add all such forms U to the list of 
lower order forms Tk'^i, . . . ,Tki^my Applying the induction hypothesis to those lower 
order forms we then obtain the claim. □ 

Applying this regularity lemma to Corollary 17.51 we obtain the following improvement: 

Corollary 7.7 (Regularised bias criterion). Let k ^ 1 be a standard integer, and let V 
be limit finite- dimensional. Let T G CSM'^(V^). Then one has 

\Eh,_h,eve{L{T{hu...,hk)))\ > 1 

if and only ifT is measurable with respect to a regular CSM-factor S = {Sk',i)i-^k'<k;i^i^m, 
of degree < k . 

To use this criterion, we need the following counting lemma which complements the 
regularity lemma. 

Lemma 7.8 (Counting lemma). Let ko ^ 1 be a standard integer, and let T = 
{Tk,i)i!^k<ko;isii!imk bc a regular CSM-system. Then for any 1, the expressions 

Tk,i,n,...,jk '■ {hi, ... ,hd) ^ Tk,i{hj^, . . ., hjj, 

where l^k<kQ, l^i^ ruk, and 1 ^ ji < . . . < jk ^ d, as functions from to ¥, 
are jointly equidistributed (as defined in Definition \C.2(^) . 

Proof. By the Weyl equidistribution criterion (Lemma IC.23p . it suffices to show that 
Ehi,...,hdeye(6( ^ ^ ^ Ck,i^j^,...,j^Tk,i{hj^,...,hj^))=o{l) (7.1) 

l^fc<fco i=l l^ji<...<jfes£d 

whenever Ck^ij^^...j^. G F are not all zero. 
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Let 1 ^ ki < ko he the largest k for which there is a non-zero coefficient Ck^ijj^^...j^- By 
relabehng we may assume that it is Cfe^_i^i_,,,^fe^ which is non-zero. We may then factorise 
the left-hand side of (17. ip as 

fci 

Ehi,...,h^gye(6(T,(/ii, . . . , hkj)) Yl fii.hu • • • , M 

i=l 

where /j : V — *C are limit functions bounded in magnitude by 1 which are independent 
of the hi variable, and T* : V^"^ — )■ F is the multilinear form 

mi 

T^,{hi, . . . ,hkj := Ckj^,i,i,-,ki'^ki,i{hi, . . . , /i^J. 
1=1 

By hypothesis, T^, has unbounded rank. By Proposition 17. 2[ we conclude that 

Eh^,...,haGve{L{T^{hi, . . . , hk,))) = o(l). 
Using Lemma 11.171 (and Lemma 14. 5p we conclude that 

Ehi,...,h^eFe(t(T4/ii, . . . , /ifci)) JJ fiih, . . . , ha)) = o(l) 

i=l 

and the claim follows. □ 

Remark 7.9. The counting lemma is essentially asserting that a regular CSM-factor is 
equidistributed on cubes in the sense of Definition lC.2H but to formalise this rigorously, 
one needs to develop a theory of polynomial maps in several variables, that generalises 
the material in Section [C] to groups filtered by N'^ rather than N. This can be done (see 
[21]), but we will not introduce this additional notation here. 

In the next section we will use the counting and regularity lemmas to finish off the proof 
of Theorem 16.61 and thus Theorem 14.61 



8. Conclusion of the multilinear inverse conjecture 



We are now ready to complete the proof of Theorem 16.61 Let s,T,V be as in that 
theorem. Applying Corollary 17. 7[ we conclude that we can write 

T = F{{Sa)aeA), 

where (5'fc_j)i^fe^s;i^j^mfe is a regular CSM-system, A is the set of tuples 

Oi {koi, iai ia,li • • • 1 ja,ka) 

with 1 ^ ka ^ s, 1 ^ ia ^ riik^, and 1 ^ < ja,2 < ... < ja,k^ ^ s + 1, F : F 
is a function, and for each a E A, Sa & CSM^"(y) is the form 

Sa{hi, . . . , hkj := Sk^^i^{hj^ -^, . . . , hj^ i^J. 



The linear forms 5*1^1, . . . , Si^mi in the regular CSM-system can be eliminated by ob- 
serving that they are simultaneously constant on some bounded index subspace of V, 
and so by passing to that subspace (using Lemma 14. 7p we may assume that mi = 0, 
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i.e. all forms Sk,i in the CSM-system are bilinear or higher in order. Of course, the 
GSM- system remains regular after doing so. 

For each Sa, we refer to the set Jq := {ja,i, ■ ■ ■ ,ja,ka} C {1, . . . , s} as the support of a. 

The permutation group Sym({l, . . . ,s + 1}) acts on the variables hi, ... , hg+i, and thus 
permutes the index set A. Because T is symmetric (and {Sa)aeA : — is surjective, 
by Lemma [7. Sp . we see that F is symmetric with respect to this action. 

Next, we show that F is also multilinear: 

Proposition 8.1. F : ^ F is a linear combination ( over ¥) of monomials 

iXa)a€A ^ Xa-^ . . . Xq,^ (8-1) 

where ai, are elements of A whose supports Jq,^, . . . , Ja^ partition {1, . . . , s + 1}. 

For instance, if s + 1 = 4, this proposition asserts that T(a, b, c, d) is a linear combination 
of expressions such as 

B{a,b)B'{c, d) 

for bilinear forms B,B' G CSM^(y) in the CSM-system, but not expressions such as 

B{a,b)B\a,c) 

or 

B{a,bfB{c, d). 

Proof. Split A = A^s+i U A^s+i, where A^s+i consist of those a E A whose support Jq, 
contains s + 1, and A^^+i := A\A^s^i. We split F^ = F^^^+i x F^^«+i in the obvious 
manner. We claim the linearity statement 

F{x=,s+i + y^s+i, x^s+i) = F{x=,s+i, x^s+i) + F{y=,s+u x^s+i) (8.2) 
whenever x^s+i,y=)s+i £ F^^^+i and x^^+i G F^^^+i. 

To prove (18. 2p . it suffices by the linearity of 

T = F{{Sa)aeA3s+iJ {Sa)a€A^^+i) 

in the /i^+i variable to locate hi, . . . ,hs, /i^+i, /i'^+i G V such that 

^Bs+iihi, . . . , hg, hg^i) = x^s+i 

SBs+lihi, . . . ,hs, /i's+i) = Z/gs+l 
'S'^s+l(^l; ■ ■ ■ ,hs, hs+i) = X^s + l 

since this implies that 

S3s+l{hl, . . . ,hs, hs+i + h'g^^) = x^s+i + y3s+i 

and 

'S'^s+i(^i, ■ ■ ■ ,hs, h'g^]) = x^s+i- 
But the existence oihi, . . . , hg, /i^+i, h'^^^ G V with these properties follows immediately 
from Lemma 17. 8[ 
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By symmetry, fl8.2p generalises to 

F{x^j + y^j,x^j) = F{x^j, x^j) + F{y^j, x^j) (8.3) 

whenever 1 ^ j ^ s + 1, x^j,y^j G F^^^ and x^j G F^^^, where A^j,A^j are defined 
analogously to A^g+i, ^^s+i by replacing s + 1 with j, and F^ is identified with F'^^j x 
in the obvious manner. We claim that the identities fl8.3p imply that F is a linear 
combination of the monomials (18.11) (note that the converse claim is clear) . 

To establish this implication, we induct on s. The case s = is easily verified, so suppose 
s ^ 1 and the claim has already been proven for smaller s. For every a G A^s+i, 
we consider the derivative Ag^-F : F"^ — )■ F of F in the basis direction Cq. From 
(18. Sp (applied to each j in the support of a) we see that Ae^F{x) is independent of 
any coefficient whose support Jg intersects J^, and thus descends to a function 
Fa on F^-^Q , where Aj± is the set of /3 G A whose support lies in {1, . . . , s + l}\Ja. 
The linearity properties (18. 3 p for j ^ descend from F to Fq,, so by the induction 
hypothesis (and relabeling) each F^ is a linear combination of monomials (18. ip with 
Jq,^, . . . , partitioning {1, . . . , k}\Ja- In particular, linear combination of 

monomials of the desired form. The function F — V ^ . a;„F„ is then invariant in 
the direction for all a G Ags+i and thus vanishes by (18. 3p (which implies in particular 
that F(0, x^s+i) = for all x^s+i G F^?-+i). The claim follows. □ 

Remark 8.2. Proposition 18.11 can be viewed as a variant of Corollary IC.22[ but to 
make this connection precise one would have to generalise the machinery in Appendix 
Oto N'^-filtered groups, as in [24j . 



From Proposition 18.11 we have 

•^o)a£A) ^ ^ ^{ai,...,ar}'^ai ■ ■ --^ar 

{ai,...,Or} 

for some coefficients C{ai,...,ar}y where {ai, . . . , a^} ranges over all unordered collections 
of elements ai, . . . , of A whose supports partition {l,...,s + l}. The multilinear 
forms Xai ■ ■ -Xar are clearly linearly independent, and so the coefficients C{ai,...,ar} are 
uniquely determined by F. In particular, since F is symmetric with respect to the 
permutation action of Sym({l, . . . , s}), the coefficients C{ai,...,ar} must be symmetric 
also. Substituting := 5^, we now write 

T = ^ ] C-^aj^^ a^ySa^ . . . Sa,.- 

{ai,...,Or} 

From the symmetry of the coefficients C[ai,...,ar}j we may split this sum into orbits of the 
action of the permutation group, and conclude that T is in fact a linear combination of 
the basic symmetric monomials 

Sym"^H'^fci,n)*---*Sym'"K'5fc„0 

with rriiki + . . . + rriiki = s + 1. Note that all the ki, . . . ,ki are at least 2, because 
we have deleted all the linear forms from the CSM-system. This gives Theorem 16.61 as 
required. 
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9. A REGULARITY LEMMA AND EQUIDISTRIBUTION FOR NON-CLASSICAL 

POLYNOMIALS 

It remains to establish Theorem 14.21 To do this, it is convenient for inductive reasons 
to estabhsh a technical strengthening of Theorem 14.21 We first need an analogue of the 
notion of a regular CSM-system, but now for polynomials instead of multilinear forms: 

Definition 9.1 (Regular factor). Let be a limit finite-dimensional vector space. A 
factor is a bounded family P = {Pij)i<^i<^m;o^j^j, of polynomials Pij e Poly^£,^.+j(p„i)(F -)■ 
*T), where m ^ and Ji, . . . , Jm ^ and Di, . . . , Dm ^ 2 are standard natural num- 
bers, obeying the relations 

pPij = P,,,_i (9.1) 

for all 1 ^ z ^ m and ^ j ^ Jj (with the convention -Pi,-i = 0). In particular, each 
Pi J takes values in the (^p>+^y^ roots of unity ^j+tIj/'Z. We refer to m as the dimension 
of the factor, Di, . . . , Dm as the initial degrees, and Ji, . . . , as the depths. The degree 
of the factor is the quantity sup]^<;j^^ D^ + Ji{p — 1) . 

A depth extension of P is a factor of the form P' = (Pij)i<gi<gm;Os;i^j;; where J- ^ J, 
for each 1 ^ i ^ m, and the polynomials Pi j in P' agree with their counterparts in 
P for j ^ Jj. If P' is a depth extension of P, we call P a depth retraction of P'. If 
d is a. standard integer, we define the degree ^ d depth retraction P^^ of P to be the 
retraction formed by deleting all Pij with Di + Jj(p — 1) > d. 

If, for every standard integer k ^ 2, the polynomials Pij with Di + j{p — 1) = k are 
linearly independent in Polj^j^iV — )■ *T) modulo Poly^^Bj^(V — )■ *T), we say that the 
factor P is regular. 

A function Q : V —> *T is said to be measurable with respect to the factor P if one has 
Q = F{Pij^, Pm,jJ for some function F : ^ *T. 

When working exclusively with classical polynomials, one can set all the depths Ji to 
zero, and the notion of a factor and a regular factor then become essentially the same 
as those considered in [22] (see also [28] for a variant of the notion of regularity in the 
low characteristic case). However, when trying to regularise non-classical polynomials, 
one unfortunately needs to consider factors of positive depth, which are more technical 
to study. 

Note that in our definition of a regular factor, the degrees Di, . . . , D^ are at least two. 
This is because we can eliminate any linear polynomials that arise in the analysis by 
passing to a finite index subspace. We will need to eliminate the linear case in order to 
avoid the k = p case of Corollarv 15.21 which is false. 

Example 9.2. Suppose one has three polynomials Pi^ G Po\j^^_^(y — )■ ^Z/Z), Pi i G 
Poly<j£,^+p_i(V ^Z/Z), and Pa.o e Fo\y^^^^p_^{V ^Z/Z) for some natural num- 
bers Di,D2 ^ 2, with pPi,i = Pifl, then P = (Pi.o, -^1,1, A.o) would be a factor of 
dimension 2, initial degrees Di,D2, depths 1,0, and degree max(Di + p — l,D2). One 
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can view this factor as a map x t— ?■ {Pi^i{x), P2fl{x)) from V from (^Z/Z) x (^Z/Z) 
(the polynomial Pi^ can be omitted from this map as it is determined by -Pi,i). 

From Lemma ll.7r iv). one can find roots Pi, 2 G Poly^£i^_,_2(p-i)(^ ^ ^^/^) ^^id 
P2,i G Poly^^2+(p_i)(\/ ^ ^^/^) of Pi,i and P2,o respectively (thus pPi,2 = Pi,i 
and pP2,i = Pj.o); then P' = (Pi,o, Pi,i, Pi, 2, A.O; -^2,1) is a depth extension of the factor 
P = (Pi,o, Pi,i, P2,o) (or equivalently, P is a depth retraction of P'), in which the depths 
have been increased from Ji = 1, J2 = to J( = 2, = 1- Of course, one can iterate 
this procedure and perform depth extensions of P to arbitrary depths. If P is inter- 
preted as a map from to (^Z/Z) x (^Z/Z), one can view P' as a lift of that map to 

i^Z/Z) X i^Z/Z), with the original map factoring through the map (x, y) ^ {px, py) 
from (^Z/Z) X (^Z/Z) to (^Z/Z) x (iZ/Z). 

For sake of concreteness, let us now suppose that D2 = Di + p — 1. Then P is regular 
precisely when the degree Di polynomial Pi has unbounded rank, and all non-trivial 
linear combinations of the degree D2 = Di+p — 1 polynomials Pi^i, P2,o have unbounded 
rank. In order for P' to be regular, one must also add the additional property that no 
non-trivial linear combination of the degree D2 + p — I = Di + 2{p — 1) polynomials 
Pi 2, P2 1 have unbounded rank. In this case, P is the degree ^ D2 depth retraction of 
P'. 

A convenient property of depth extensions is that they preserve regularity: 

Lemma 9.3 (Depth extensions preserve regularity). Let P = (Pi,j)i^i^m;o^jXJi ^ 
regular factor, and let P' = (P/j)i^j^m';Os£j^j( be a depth extension of P. Then P' is 
also regular. 

Proof. By induction, it suffices to verify the claim in the case when ^ Ji + 1. 

Suppose for contradiction that P' is not regular, then there exists k ^ such that 
the polynomials P-j with Di + j{p — 1) = k have a non-trivial linear dependence in 
Poly^fc(V -4 *T) modulo Poly^^^BRl^ ^ thus 

J2 c,,Pl^ e Poly^,,BR(^ ^ *T) 

for some coefficients Cjj G Z, not all zero. 

Suppose that Qj vanished whenever j = Jj + 1, then this linear dependence already 
occured in P, implying that P was also not regular, a contradiction. Thus we have 
Ci,Ji+i 7^ for at least one i; since ^ 2, this also forces k > p. (It is here that we 
crucially need to prevent from being equal to 1.) We now multiply the above linear 
dependence by p using Corollary 15.21 and f l9.ip to conclude that 

J2 CijPlj-i e Poly^fe_p+i,BR(V^ ^ 

l^j^m;Os£j^Ji;D,+j(p-l)=fc 

But this is again a non-trivial linear dependence in P, again yielding contradiction. □ 
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We now localise the exact roots property from Theorem 14.21 to depth extensions of a 
regular factor. More precisely, we introduce the following property: 

Definition 9.4 (Exact roots property). Let s be a standard integer. We say that the 
exact roots property ER(s) holds if, whenevever P = (-Pij)i<gi<gm;Os;jXJi is a regular factor 
on a limit finite-dimensional vector space V of some initial degrees Di, . . . , Dm, and Q G 
Poly^^_,_i(l^ —J- *T) is a function of P, and P' = {Pij)i<^i<^mfisij(iJ' is a depth extension of 
P with Di + J-{p - 1) > s for all 1 ^ i ^ m, then there exists R e Poly^^+p(\^' -)■ *T) 
that is a function of P' such that pR = Q on V^. Furthermore, R can be taken to be 
a linear combination of those Pi,j in P' with Di + j{p — 1) = s + p, plus a function of 
those Pi J with Di + j{p — 1) < s + p. 

Example 9.5. We continue Example 19.21 again supposing that D2 = Di + p — 1 for 
concreteness. Assume that P is regular. Let s < Di + 2{p — 1) be such that ER(s) 
holds, and suppose that we have a polynomial Q G Poly^^_,_]^(V *T) that is a function 
of the factor P, thus Q = ^2,0) for some F : (^Z/Z) x (^Z/Z) *T. Then we 

can find a polynomial R G Po\j^gj^p{V — > *T) that is a root of Q (thus pR = Q), which 
is a function of P' (thus R = F'(Pi,2, ^2,1) for some F' : (^Z/Z) x {^Z x Z) ^ *T). 
If s was equal to i^i + p — 2, then R would be a linear combination of Pi^2, -^2,1 plus a 
functin P"(Pi i,P2^o) of the lower degree polynomials Pi,2)-P2.i- 

One consequence of the exact roots property is that it allows for a regularity lemma: 

Lemma 9.6 (Regularity lemma). Let sq ^ 1 be a standard integer such that ER(s) 
and GIP(s) hold for all ^ s < sq. Suppose that P = (Pi,...,Pm) are a bounded 
tuple of polynomials Pi, ... , Pm G Poly^3jj(V — )■ *T) on a limit finite- dimensional vector 
space V . Then there exists a bounded index subspace V of V and a regular factor 
Q = (Qij)i^i^m;o^j^j, on V of dcgrcc at most Sq such that all the Pi are measurable 
with respect to Q on V' . Furthermore, on V' , if Pi has degree di, then Pi is a linear 
combination (over Z) of those polynomials Qij with D^ + j{p — 1) = di, plus a function 
of the depth retraction Q^di-i ofQ. 

Proof. We induct on sq. The claim is trivial for sq ^ 1 (note that any polynomial of 
degree 1 can be made constant by passing to a bounded index subspace), so suppose 
that Sq ^ 2 and that the claim has already been proven for smaller sq. Observe from 
Lemma [TT71 that pPolj^^^iV *T) C Poly^5p3R(\^ *T), thus the abehan group 
Poly^^p(y — 7- *T)/ Poly^^jj Bj^(y — t- *T) is in fact a vector space. This vector space 
contains Poly^g^(y — )■ ^(F))/ Poly^^^ bj^(\^ — )■ i(F)) as a subspace. 

As every finitely generated vector space has a finite basis, we may thus represent 
Pi,...,Pjn as a linear combination (over Z) of a bounded number of polynomials 
Pi, . . . , P^, G Poly^gjj(V^ — )■ *T) that are linearly independent modulo Poly^^^ br(^ ~^ 
*T) + Poly^gj|(V — )■ t(¥)), a bounded number of classical polynomials Pi, . . . ,Rm" ^ 
Poly^^p(V — )■ i(F)) that are linearly independent modulo Poly^^^ bj^(V^ — )■ i(F)), and a 
bounded number of bounded rank polynomials Si, ... , Sm'" G Poly^^^j br(^ *'^)- By 
definition of Poly^^^, br(V^ — )■ *T), the Si, ... , Sm'" are in turn functions of a bounded 
number of polynomials S[, . . . , S'^,,,, G Poly^^|j_i(\^ — )■ *T). 
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By Lemma \T77\ the polynomials pP[, . . . ,pP^i have degree at most max(so — p + I, 0), 
and in particular have degree ^ Sq — 1. Applying the induction hypothesis, and passing 
to a bounded index subspace V of V, the polynomials S[, . . . , S'^„„,pP[, . . . ,pP^^i, when 
restricted to V, are then all functions of a single regular factor Q = (Qjj )i<;i<;m""';o^is:Ji 
of degree ^ sq — 1 with initial degrees Di, . . . , Dm'"" ^ 2. 

Henceforth all polynomials will be understood to be restricted to V. Using Lemma H?7t 
we see that the various linear independence properties on P/, . . . , P^, and Ri, . . . , Rm" 
descend from V to V . 

By using Lemma ll.7( v) to perform depth extensions on Q as necessary, we may assume 
that 

So - p < A + Ji{p - 1) ^ So - 1 

for alH = 1, . . . , m; of course, this keeps the degree of Q to be ^ So — 1. Note that Q 
also remains regular, thanks to Lemma 19.31 

We now perform one further depth extension to Q to obtain Q' := {Qij)i^i^m""'-fi^j^Ji+i 
by choosing to be a polynomial in Poly^o^+(j^+i)(p_i)(\/' *T) obeying = 

Qj^j.; such a polynomial is available thanks to Lemma [1.7( v). This is clearly a factor. 
By Lemma 19. 3[ Q' is also regular. 

Note that when constructing Q', we have the freedom to modify each Qi.j^+i additively 
by a classical polynomial from Poly^^._,_(j.^i)(p_i)(V^' — i(F)); this freedom will be 
important later on. 

Now consider the polynomials pP[, . . . ,pP^,. By Lemma ll.7[ they have degree ^ 
max(so — p + 1,0), and are also functions of Q. Applying the hypothesis ER(so — p) 
(and refining V if necessary), we conclude that we can find polynomials Ui, . . . , Um' G 
Poly<;^j^(V^' — i- *T) that are functions of Q' and such that 



for all 1 ^ / ^ m'. In other words, P/ and Ui differ by a classical polynomial in 
Poly^^i^(y — )■ i(F)). (Note that these claims are trivial when so ^ p — l; the hypothesis 
ER(so— p) is only needed when so > p — !)• Furthermore, each Ui is a linear combination 
of those Qi^j.+i with Di + (Jj + l)(p — 1) = so, plus a function of Q'^g^^_i, thus 



for some coefficients q^, G Z, where Ei is a function of Q^so-i^ and A is the set of 
those 1 ^ z ^ m'"" with Di + (Jj + l){p — 1) = sq. We claim that the vectors q : = 
{ci^i mod p)i<zA G for Z = 1, . . . , m' are linearly independent. Indeed, suppose for 
contradiction that we had a non-trivial linear dependence 



in F^ for some ai, . . . , a^' G {0, ... ,p — 1}, not all zero. Then by (19.21) . the polynomial 



pp; = pUi 




(9.2) 



aiCi + . . . + am' Cm' 







aiUi + . . . + am'Um' 
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is a function of Qscso-i' ^^"^ '^^ degree ^ Sq. As the Ui differ from P/ by an ele- 
ment of Poly^3jj(y — )■ i(F)), we conclude that the P{, . . . ,P^, are linearly dependent 
modulo Poly^^i^ — > *T) + Poly^^|j(\^' — )■ l(¥)), contradicting the construction of 
P[, . . . , P^,. Thus the Ci, . . . , c^' are linearly independent. 

We would like to modify Q' so that the Ui will agree with P[ exactly. To do this, 
recall that we had the freedom to modify each of the Qij^+i by an arbitrary classical 
polynomial in Poly^^jj(V^' — )■ i(F)); this modifies the Ui by a corresponding classical 
polynomial in Poly<;^g(V^' — )■ i(F)). Because the ci, . . . , Cm' are linearly independent, the 
Ui can be so modified independently. Since P/ and Ui already only differed by such a 
classical polynomial, we can thus modify each of the Qij^+i so that the Ui are equal 
to P/ simultaneously for all 1 ^ i ^ m'. Having done so, we now see that the P/ are 
functions of Q'^^^. 

Now we extend Q'^so ^ further factor Q" by adjoining the classical polynomials 
Pi, ... , Rm" as new dimensions of degree sq and depth 0. This is still a factor; we claim 
that it remains regular. To see this, we need to show that the Pi, . . . ,Pm", together 
with the Qij with Di+j{p— 1) = sq, are linearly independent modulo Poly^j^^j br(^' ~^ 
*T). Note from construction that all such Qij must be of the form Qi.j^+i. Suppose 
for contradiction that there was a non-trivial linear dependence. As the Pi, . . . ,Rm" 
were already linearly independent modulo Poly^^^ — t- *T), this dependence must 
involve at least one of the Qj,j^+i. Multiplying by p and using Corollarv 15. 2[ we see 
that there is a dependence among those Qi^^ with Di + {Ji -|- — 1) = Sq modulo 
Poly<;so-p+i,BR(^' — )■ *T), but this contradicts the regularity of Q. 

By construction, all the polynomials P{, . . . , P^,, Pi, ... , Rm", Si, ... , Sm'" are functions 
of the regular factor Q", which has degree ^ Sq, and so Pi, ... , P^ are functions of Q" 
also. A careful inspection of the above argument also shows that each Pj was in fact an 
integer linear combination of those Q'-j in Q" with Di + j{p — 1) = di, plus a function 
of <5'^rf-_i (the cases di = sq and di < sq have to be treated separately). The claim 
follows. □ 



In view of the regularity lemma. Theorem 14.21 now follows from 

Theorem 9.7 (Exact roots, technical version). Let Sq ^ 1 he such that ER(so — 1) 
holds, and GIP(s) holds for all s ^ sq. Then ER(so) holds. 

Indeed, assuming Theorem 19. 7[ then in the situation in Theorem 14. 2 ^ we have ER(s') for 
all ^ s' ^ s by strong induction (the case ER(0) being trivial). If P G Poly^^,^]^ bi^(V^ — )■ 
*T) for some ^ s' ^ s, then by Lemma 19.61 P is a function of a regular factor P of 
degree ^ s' . We then create a depth extension R' of P by extending all the depths 
J'i so that s' < Di + J'-{p — 1) ^ s' + p — 1; the existence of such a depth exten- 
sion is guaranteed by Lemma fTTTT v) . Applying ER(s), we conclude that we can find 
Q G Poly^^/^p(V^ — )■ *T) that is a function of P' such that pQ = P. Since P' has degree 
^ s' + p — 1, Q has bounded rank, and Theorem 14.21 follows. 
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It remains to establish Theorem 19.71 To do this, we first require an equidistribution 
lemma, analogous to (but more complicated than) Lemma 17.81 

Let P = (-Pij)i<;i^m;o^i^j, be a factor with initial degrees Di, . . . , Dm- By (19. ip . the 
polynomials in P are in fact functions of P := {Pi^jji^i^m, which we interpret as a map 
from V to *T"'. 

It is now convenient to use the machinery of polynomial algebra, which is reviewed in 
Section O We define a filtration (*T™)n on *T™' by defining (*T'")fc for each standard 
natural number k to be the (finite) group generated by the elements ^j.^j+i e^ mod *Z™' 
for which 1 ^ z ^ m, ^ j ^ Jj, and k ^ Di+j{p—l), where ei, . . . , is the standard 
basis of M™. This is easily seen to be a filtration (see Example IC.3jl . We call this the 
filtration with depths Ji, . . . , Jm and initial degrees Di, . . . , Dm- 

Lemma 9.8 (Equidistribution of regular factors). Suppose that GIP(s) is true for all 
^ s ^ sq, let P = {Pij)i^i^m:o^j^j^ be a factor of degree ^ sq, and let k ^ be a 
standard integer. Set P := {Pi^Ji^i^m; and let *T™' be given the filtration with depths 
Ji, - - - , Jm and initial degrees Di, . . . , Dm- We give V the maximal degree ^ 1 filtration 
(see Example \C-3\) - 

(i) P : y — 7- is a polynomial map (see Definition \C-6\) - 

(ii) If furthermore P is regular, then P : V *T™' is equidistributed on cubes (see 
Definition\U^2^. 

Proof- To prove (i) , it suffices by Definition IC.6I to verify that 

A,,...A,,P(x) G {*T^)k (9.3) 
whenever A; G N and hi, . . - , hk, x & V- But if 1 ^ i ^ m, then we clearly have 

A,,...A,,P,,,(x)G^Z/Z 

since Pj^j^ takes values in ^^j^pj-Z/Z; and if Di + j{p — 1) < k < D^ + {j + l){p — 1) for 
some ^ j ^ Jj then 

A,,...A,,P,,,(a;) = 

and since p-^^~^Pi^j. = Pij we get 

A,,...A,,P,j,(x)G^Z/Z. 
Comparing this with the definition of (*T'")fc we obtain (19. 3p . 

Now we verify (ii), which is trickier. We need to show that the map IIK'^(P) : IIK'^(K) — )■ 
HK*^(*T'") is equidistributed. By the Weyl equidistribution criterion (Lemma IC.23p . it 
suffices to show that 

E,,h„...,H<^ve{vm\P){x, hi,---, hk))) = 0(1) (9.4) 
whenever t] : HK'=(*T™) ^ T is a non-zero homomorphism. 
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Observe that HK'=(*T'^) is a (finite) subgroup of (T™)'t°'^^''. By Pontryagin duahty, r] 
must therefore be the restriction of a homomorphism from (T'^)^'^'^^ to T. In other 
words, we can find (non-unique) integers Ci^uj for 1 ^ z ^ m and u G {0, 1}™" such that 

m 

«=i we{o,i}''- 

for all (tj,a;)i^i^m;tJG{o,i}* HK''(*T™). Iu particular, the left-hand side of f l9.4p becomes 

m 

^x,hu-,hk&veC^ ^ Ci,ujPi,jA^ + uJih + . . . + ujkhk)). (9.6) 



The coordinates ti^^j ^ *T of points (tj,a;)i^i^m;tjg{o,i}'^ 

HK*^(*T™) obey a number of 
constraints. Firstly, for each u, (ti,tj)i^i^m must lie in (*T™')o, or in other words we have 

p''^\u. = (9.7) 
whenever 1 ^ i and co G {0, 1}*^. Secondly, from Proposition IC. 16l we see that 

p'^-'j2^-iy^k.,^ = o (9.8) 

uieF 

whenever 1 ^ i ^ m, uj E {0, l}^ ^ J < J„ and F is a face in {0, 1}'^ of dimension 
greater than Di + j{p — 1). (In fact, Proposition IC.161 asserts that these are the only 
constraints on the ti^^.) 

We can use these constraints to place the coefficients Cj^^^ in a "reduced form" , as follows. 
First observe from fl9.8p that if there exist 1 ^ i ^ m, u E {0, 1}^, and ^ j ^ Jj 
with \u\ > Di+ j{p — 1) and Ci^^ ^ pJi-J or Ci^^ < then by adding a suitable multiple 
of fl9.8p to (19. Sp . one can place Cj^^^ in the interval {0, . . . — 1}, at the expense 

of changing the values of Ci^^^i for various u' with \uj'\ < \uj\. Iterating this procedure 
(starting with those u with large values of |ci;| and then working downward) we may 
assume without loss of generality that 

^ c,,^ < (9.9) 

whenever 1 ^ z ^ m, G {0, l}'^, and ^ j ^ Jj is such that > Di + j{p — 1). 

In a similar spirit, by using (19. 7p . we may assume that 
for all 1 ^ i ^ m and u G {0, l}'^. 

Since T] is non-zero, at least one of the Ci^^j is non-zero. Let G {0, 1}'^ be such that 
Cj^^, is non-zero for at least one 1 ^ i ^ m, and such that la;*! is maximal with respect 
to this property. By permutation symmetry we may assume that = l^Q^-^ = 
(1, . . . , 1, 0, . . . , 0) for some ^ K ^ k. The expression (19. 6 p can then be factored as 

K 

Ex,/ii,...,/ifeGye(Q(x + hi + ... + Hk)) JJ //(x, hi,...,hk) 

1=1 
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where Q : V *T is the expression 

m 

Q ■ ^ ^ ^i,uit Pi,Ji 
i=l 

and each /; : V^'^ — )■ *C is a hmit function bounded in magnitude by 1 and independent 
of hi. Using the second Cauchy-Schwarz-Gowers inequahty (Lemma IB.l( v)). it thus 
suffices to show that 

MQ)\\u'<iv) = o{l). 

Let I := {1 ^ i ^ m : Ci^^^^ ^ 0}, then / is non-empty. For each z G /, let ^ ji ^ Ji 
be the least integer j such that 'P'^^~^\'^i,^,- Since p'^^~^'Pij. = Pij^, we thus have 

Q = J2^^P^,n (9-10) 
for some integers that are not divisible by p. 
Let 

D := sup Di+ji{p - 1). 
iei 

As P has degree ^ Sq, we have D ^ sq. Also, since each Pjj- has degree Di + ji{p — 1), 
we see that Q has degree ^ D. 

If one had 

Di+jiip-l) < = K 
for some i G I, then from (19.91) we would have q^^, < p'^^~\ a contradiction; so we must 
have Di + ji{p — 1) ^ K for all i & I. In particular, D ^ K. By the monotonicity of 
the Gowers norms (Lemma IB. ll (ii)). it thus suffices to show that 

l|e(Q)||c/D(y) = o(l). 

Applying the induction hypothesis GIP(D — 1), it thus suffices to show that Q ^ 
Poly<;£, br(^ But, as P is regular, the polynomials Pjj. with +ji{p — 1) = D 

are linearly independent over F modulo Poly<£, bj^(V^ — )■ *T), and the polynomials Pij. 
with Di + ji{p — 1) < D already lie in Poly^^, br(^ ~^ *'^)- Since there is at least i G / 
with Di+ ji{p — l) = D, and all coefficients in (19.101) are nonzero modulo p, we obtain 
the desired claim. □ 

From the above corollary and Corollary IC.22I we conclude 

Corollary 9.9 (Polynomials on a regular factor). Suppose that GIP(s) is true for all 
^ s ^ So, let P = (-Pi,j)i^i^m;OQXJi ^ regular factor of degree ^ sq, let P : V *T"^ 
be the associated map P = {Pi,ji)i^i^m, let d ^ be an integer, and let f : *T'" — )■ *T 
be a function. Then the following are equivalent: 

• f{P) : V *T is a polynomial of degree ^ d. 

• f : — > *T zs a polynomial map (where we give *T™ the filtration with 
depths Ji, . . . , Jm and initial degrees Di, . . . , Dm, and *T the maximal degree 
^ d filtration, see Example \C.3\) . 
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We now lift the conclusion of Corollary 19.91 from *T'" to Z"^. Given natural numbers 
Di, . . . , Dm, we define the filtration (Z'^)n of initial degrees Di, . . . , Dm on Z™ by setting 
(Z'")fc, for each natural number k, to be the subgroup of Z"^ generated by those elements 
p^Cj for which 1 ^ i ^ m, j G N, and k ^ Di + j{p — 1); this is easily seen to be a 
filtration. 

Corollary 9.10 (Polynomials on a regular factor, again). Suppose that GIP(s) is true 
for all ^ s ^ So, let P = (-Pj,j)i^i^m;o^j^Ji be a regular factor of degree ^ sq and 
initial degrees Di, . . . , Dm, let d ^ be an integer. Let Q : V ^ *T be a limit function. 
Then the following are equivalent: 

• Q is measurable with respect to P, and is a polynomial of degree ^ d. 

• There exists a polynomial map f : 11^ — )■ *T from Z*" (with the filtration of 
initial degrees Di, . . . , Dm) to *T (with the maximal degree ^ d filtration), such 
that f is periodic with period p'^^'^^Ci for each 1 ^ i ^ m, and such that one has 

Q{x) = /(ai, . . .,am), (9.11) 

whenever x (zV and ai, . . . , G Z are such that 

for all 1 ^ i ^ m. 

Proof. If Q is measurable with respect to P and is a polynomial of degree ^ d, then 
by Corollary 19. 9^ we can write Q = f{P), where / : *T™ — )■ *T is a polynomial map 
from *T™' (with the filtration of initial degrees Di, . . . , Dm and depths Ji, . . . , Jm) to *T 
(with the maximal degree ^ d filtration). 

Let : Z™ ^ *T'" be the map 

0(ai, ...,am):= (^ mod 1, ... , mod 1). 

One easily verifies that this is a polynomial map (indeed, it is a filtered homomorphism) 
from Z"* (with the filtration of initial degrees Di, . . . , Dm) to (with the filtration 
of initial degrees Di, . . . , Dm and depths Ji, . . . , Jm)- Thus the function / : Z*" — > *T 
defined by / := / o is also a polynomial map from Z'" (with the filtration of initial 
degrees Di, . . . , Dm) to *T (with the maximal degree ^ d filtration). It is also periodic 
with period p'^'+^Cj for each 1 ^ i ^ m, because (f) is also periodic with these periods. 
By construction one also has (19.111) whenever fl9.12p . This proves one implication of the 
corollary. The other implication follows by reversing the above argument (noting that 
is weakly equidistributed on cubes in the sense of Definition IC.18| so that one can 
apply Lemma [C.19p . □ 

In view of the above corollary. Theorem 19.71 can now be deduced from an analogous 
result on the integer lattice Z™, which we formulate precisely as follows: 

Proposition 9.11 (Exact roots in Z™"). Let m ^ 0, Di, . . . , Dm ^ 1, and d ^ be 

standard natural numbers. Let f : Z'" *T be a polynomial map from Z"^ (with the 
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filtration of initial degrees Di, . . . , Dm) to *T (with the maximal degree ^ d filtration). 
Then we can find a polynomial map g : Z*" — *T from Z™ (with the filtration of initial 
degrees Di, . . . , Dm) to T (with the maximal degree ^ d + p — 1 filtration) such that 
pg = f ■ Furthermore, g is a linear combination (overZ) of the functions (ci, . . . , am) ^ 
mod 1 with 1 ^ i ^ m and Di+j{p—l) = d+p—1, plus a function of the expressions 
Qi mod ph>~^^ with 1 ^i and Di+j{p — l)<d + p— 1. In particular, g is periodic 
with period p^~^^ei whenever 1 ^ i ^ m and Di + j(p — 1) > d. 

We remark that in contrast with the previous arguments, the above proposition holds 
even when the Di are equal to 1 (indeed, Lemma fTTTT iii) can be viewed as a special case 
in which Di = . . . = Dm = 1 and the functions are periodic with period pZ™). 

We will prove this Proposition in the next section. For now, we show how Proposition 
19.111 implies Theorem 19.71 and thus Theorem 14. 2[ 

Proof of Theorem \9. ?| assuming Proposition \9.11[ Let Sq ^ 1 be such that ER(so — 1) 
holds, and GIP(s) holds for all s ^ Sq. Let P be a regular factor of degree at most Sq 
and some initial degrees Di, . . . , Dm, let Q e Poly<;^jj_,_^(V^ — )■ *T) be a function of P, 
and let P' = (Pjj)is;is:m;Os£iscj; be a depth extension of P with D^ + Jl{p — 1) > sq for 
all 1 ^ i ^ m. Our objective is to find a polynomial R G Poly^^_,_p(V" — *T) that is a 
function of P' such that pR = Q. 

Note from Lemma 19.31 that P' is automatically regular. 

By Corollary 19.101 we can find a function / : Z™ — )■ *T of weighted degree ^ Sq + 1 
which is periodic with period p'^'^^Ci for each i = 1, . . . ,m, such that one has 

Q{x) = f{ai, ...,am) 
whenever x G V and Cll , . . . , CLm G Z are such that 

Pi J (x) = —TTT mod 1 

for all 1 ^ 2 ^ m. 

Applying Proposition I9.11[ one can find a function g : — > *T of weighted degree ^ 
So+p which is periodic with period p-'"^^ej whenever 1 ^ i ^ m and 1) > sq + I, 

such that pg = f. In particular, g is periodic with period p'^^^^Ci for each 1 ^ i ^ m. 
We may therefore define the function P : — !■ *T by setting 

R{x) := g{ai, . . . , a^) 

whenever x G V and ai, . . . , G Z are such that 

Pi,J'ix) = —jTTT mod 1 

P 

for all 1 ^ z ^ m; the periodicity properties of g ensure that R is well-defined. By 

Corollary 19.10^ P is a function of P' which is a polynomial of degree ^ Sq + p. By 

construction, one has pR = Q, and the claim follows. □ 
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The only remaining task is to establish Proposition 19.111 This will be the subject of 
the next section. 



10. Exact roots for polynomials on Z*" 

We now prove Proposition 19.111 Throughout this section the dimension m ^ and 
initial degrees Di, . . . , Dm ^ 1 are fixed. 

It is convenient to rephrase the polynomiality condition in terms of derivatives. Define 
a basic generator v to be an element of Z™ of the form p'Ci, where 1 ^ i ^ m and j ^ 0. 
Define a multigenerator to be a tuple v = {vi, . . . ,Vr) of basic generators, where r ^ is 
a standard natural number. We associate to each basic generator p'Ci a weighted degree 
deg(]>'ej) := + j{p — 1), and associate to each multigenerator v = {vi,...,Vr) a 
weighted degree deg(-y) := deg(f i) + . . . + deg(fr). We also associate to v the differential 
operator 

We say that a function / : Z™ — )■ *T has weighted degree ^ d if one has A^/ = 
whenever deg(w) > d. In other words, we have 

m oo 

(nn^%)/=o 

i=l j=0 

whenever natural numbers (at most finitely many of which are non-zero) with 

Er=iEr=o«M(A+j(p-i))>rf- 

From Proposition IC.IOI we have 

Proposition 10.1 (Differential characterisation of polynomiality). Let f : Z™ — t- *T 

be a function, and let d ^ be a standard natural number. Then the following are 
equivalent: 

(i) f is a polynomial map from Z™ ( with the filtration of initial degrees Di, . . . , Dm) 
to *T (with the maximal degree ^ d filtration). 

(ii) / has weighted degree ^ d. 

One nice feature of a weighted bounded degree polynomials is that they have some 
periodicity properties: 

Lemma 10.2 (Periodicity properties). Let f : Z"^ — > *T be of weighted degree ^ d. 

(i) For any 1 ^ i ^ m, f is periodic with period p'ci whenever 1 ^ i ^ m and 
j ^ are such that Di + j{p — 1) > d. 

(ii) f is a linear combination (over 'L) of the functions (ai, . . . , a^) i— )■ mod 1 
with 1 ^ i ^ m and Di + j(p — 1) = d, plus a function of the expressions 
ai mod p'^'^ with 1 ^ i ^ m and Di + j{p — 1) < d. 
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Proof. If Di + j{p — 1) > d then ^pjej = by Proposition 110.11 This proves (i). 

Now we turn to (ii). Let / denote the set of all 1 ^ z ^ m for which there is a natural 
number jj for which Di + jiij) — 1) = d. If i G /, then A^j^g./ has weighted degree ^ 
and is thus constant. On the other hand, by (i), / is periodic with period p^'+^ej. We 
conclude that /^pj^^J = Cj/p for some q G {0, . . . ,p-l}. Write g := Xljg/ Cj^^ mod 1, 
then we see that g is periodic with period p^'Ci for each i G /, and thus (by (i)), is also 
periodic with periodic p^Ci whenever 1 ^ m and Di + j{p — 1) < d. □ 

From Proposition 110.11 and Lemma 110.21 we may thus rephrase Proposition 19.111 as 
follows: 

Proposition 10.3 (Exact roots in Z™, again). Let d ^ be standard. Let f : Z"^ — )• *T 
be a map of weighted degree ^ d. Then we can find a map g : Z"^ — > *T of weighted 
degree ^ d + p — 1 such that pg = f ■ 

In order to prove Proposition 110. 3[ we use the following explicit description of those 
functions / : Z™ — )■ *T of a given weighted degree, which generalises Lemma [1.7( iii): 

Proposition 10.4 (Classification of polynomials). Let d ^ be standard, and let f : 
Z™ — 7- *T 6e a map. Then the following are equivalent: 

(i) / has weighted degree ^ d. 

(ii) / can be expressed as 

il, . . . ,im 3: 0; r > : 

for some a E *T and integers Ci^^,„^i^^^r- 

Proposition 110.41 immediately implies Proposition 110. 3[ since any element a G *T has 
a p^^ root, and any "monomial" '^'^'V+r''' (i^) • • • (i"") of degree ^ d has a p^^ root 
^l^O---(t) of degree ^d + p-L " 

Proof. We first show that (ii) implies (i). As each constant function a clearly is of 
degree ^ d, it suffices by linearity to show that the multinomial 

M:(a:„...,xJ.^^(^;)...(^2)modl 

has weighted degree ^ (S^i ^jh) + ^(P ~ 1) fo'^ H, ■ ■ ■ ,im ^ and r G Z. (For 
inductive reasons we include the case when r is negative, but the claim is trivial in those 
cases as the multinomial M then vanishes modulo 1.) 

We prove this by induction on the weighted degree dM '■= Cl2JLi Djij) +r{p— 1). When 
dM ^ or less the claim is trivial, so suppose that dM is positive, and that the claim 
has already been proven for smaller values of dM- 
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To show that M has weighted degree ^ du, it then suffices to show that Apfeg_^.M has 
weighted degree ^ (Im ~ Dj — k{p — 1) for each 1 ^ j ^ m and k ^ 0. For sake of 
argument we shall just verify this when j = m, though the other cases are of course 
similar. From the binomial identity f l6.5p one has 

A,., Mix,, • • • , = g ^ . . . ( J: ^ (^^ ) mod 1, 



so it will suffice to show that each term 

^(::)-C:::OC>°- 

with 1 ^ I ^ im has weighted degree ^ (Im — Dm — k{p — 1) . 

Fix I. We may assume that / ^ p'^, since the binomial coefficient (^^ ) vanishes otherwise. 
Let t be the largest natural number such that divides /, then t ^ k. Inspecting the 
binomial coefficient 

^pk\ pk pk _ _ 2 pk _ I _^ I 



I J I I 2 I -I 

we see that p^^* divides (^^ ) . Absorbing this factor into the -f+i term in fllO.ip and 
using the induction hypothesis, we conclude that (110. ip has weighted degree 

m— 1 

^ Djij) + Dm{im -l) + {r-k + t){p- 1). 
i=i 

But note that 

Dm{l-1) > {l-l)^p' -l>t{p-l), 

and the claim follows. 

Next, we show that (i) implies (ii). This claim is trivial for d ^ 0, so suppose inductively 
that d > and that the claim has been proven for smaller values of d. We then fix d 
and assume as a second induction hypothesis that the claim has already been proven 
for smaller dimensions than m. We may assume that m > 0, since the m = case is 
trivial. 

Let / be of weighted degree ^ d, and consider the derivative Ag^/. By (i), this function 
has weighted degree ^ d—Dm, and thus by the induction hypothesis has a representation 
of the form 

Jl, . . . , «m 3= 0; r > : 



We now introduce the function 



il,. . . ,im 3! 0; r ^ : 

(Er=i D,ij) + r(p - 1) d - D„ 
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As (ii) implies (i), we know that g has weighted degree ^ d. From Pascal's identity we 
have 



A.^^(x„ . . . , x^) = [-J ■ ■ ■ ( J ) 1 

i\, . . . ,im ^ 0, r ^ : 



and thus 
We thus have 



Ae^/ = a + Ae^g. 



fi^Xi, . . . , XfYi) OiXfiYi + f(yXi, . . . , Xfn—i, 0) -|- g{X\^ ■ • • 1 •^m)- 

As / has weighted degree ^ d, the m — 1-dimensional function [xi, . . . ,Xm-i) ^ 
/(xi, . . . , Xm-i, 0) does also. By the second induction hypothesis, /(xi, . . . , Xm-i, 0) is 
already of the required form for (ii), while g is also of the required form by construction. 
It remains to show that the term aXm has the required form. 

By linearity, of weighted degree ^ d. By Lemma [10.2^ we thus see that jPa = Q 

whenever D^n + j(p — 1) > d. If we thus let j be the first natural number for which 
Dm + j{p — I) > d, then ax^ is a multiple of ^(^™)' (^^) i^^pli^s (i), this has degree 
^ Dm + (j — — 1) ^ d, and the claim follows. □ 



The proof of Proposition 110.31 (and thus Proposition 19. lip is now complete. 



11. Deducing the inverse conjecture from the inverse conjecture for 

polynomials 



In this section we deduce Theorem 11.111 from Theorem ll.20[ This deduction can be 
done in either a finitary or an infinitary setting. In the finitary setting, one uses struc- 
tural decomposition theorems as in [10], [13], [22], [E], [IS]; the arguments in [TB] are 
particularly close to those here. In the infinitary setting one can proceed by analogous 
decomposition theorems based on conditional expectation. We shall follow the latter 
approach here, in order to illustrate the parallel nature of the two arguments. (This 
latter approach is also adopted in [37].) 

We first give a general abstract structural decomposition. 

Lemma 11.1 (Decomposition). Let V be a limit finite set, and let S be a family of 
limit functions P : V ^ X on V , each of which takes only a finite number of values. 
(We do not assume that S is itself a limit set.) Let f : V *C be a limit function 
bounded in magnitude by some standard real A. Then one can decompose 

f /str ~l~ /psd 

where /str, /psd '■ V ^ *C are limit functions bounded in magnitude by A and 2A respec- 
tively, with the following properties: 
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• r/str almost structured) For every standard e > 0, one can find a function : 
y — >■ *C that is a function of boundedly many functions from S, such that f^ is 
boundedin magnitude by A and \\fstr-fs\\L^v) '■= {'^xev\fstr{x)-fe{x)\'^y/^ ^ e. 

• r/psd pseudorandom) For every function g : V ^ *C that depends on only 
boundedly many functions from S, one has {fpsd, 9) l^{v) '■— ^xevfpsd{x)9{x) — 

0(1). 

Proof Given any finite subset Sq of S, let B{Sq) be the cr-algebra of V generated by 
the level sets of the functions of S^; this is a finite cr-algebra, with every atom being a 
Umit subset of V. Given such a cr-algebra, we can define the conditional expectation 
E{f\B{So)) : 1/ ^ *C of / by the formula 

E{f\B{So)){x) - E,eB(5o)(.)/(y) 

for all X eV, where B{So){x) is the atom of B{So) that contains x. Clearly E{f\B{So)) 
is bounded in magnitude by A, and the energy 

is a non- negative real number between and A'^. Let i?max denote the suprcmum of 
£(iSo); as the energy is monotone in Sq we can thus (using the axiom of choice) find an 
increasing sequence 5„ for n G N such that S{Sn) — )■ -Emax- 

Prom Pythagoras' theorem we have 

st||E(/|B(5„0 - E(/|B(5„)||i2(^) = £{Sn') - £:(5„) 

for any n' > n, where six denotes the standard part of the limit real x. Thus the 
E(/|B(«Sn) are an Cauchy sequence in the sense that 

hm st||E(/|B(5„0) - E(f\B(Snml.^v) = 

We claim that this implies the existence of a limit function /gtj. : V — )■ C, bounded in 
magnitude by A, such that 

hm st||/str-E(/|i3(5„))||i2(v.) = 0. 

Indeed, if we write V = Ua-^a^ f = linia^aoo fa, and cS„ = Ua^a^o 
set 

/,tr := lim E{f^\B{Sn^,a)) 

and the claim will follow if Ua increases to infinity at a sufficiently slow rate; we omit 
the routine details. 

Now let g depend on a bounded number S' of functions from S, such that g is bounded 
in magnitude by B. Then for any standard natural n, one can rewrite 

st{f-E{f\BiSn)),g)mv) 

as 

st(E(/|^(5„ U S')) - Eif\BiSn)),g)v^^v) 
which by the Gauchy-Schwarz inequality is bounded in magnitude by 

B\\E{f\B{S^ U S')) - E{f\B{SMLHv) 
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which by Pythagoras' theorem and definition of -Emax is bounded by 



1/2 



max 



and thus 



hm st{f-E{f\B{S^)),g)mv) = 0. 



Taking hmits using the Cauchy-Schwarz and triangle inequahties, we conclude that 

lim st(/ - fstr,9)L2(v) = 0. 



Remark 11.2. Although we will not need this fact here, it is often useful to observe 
that if / is non-negative, then /str and can be taken to be non-negative also. One can 
also establish this lemma using the machinery of Loeb measure [32]: if n is Loeb measure 
on V, then /gtr is essentially the conditional expectation (in L'^{fi)) of / with respect 
to the cr-algebra generated by S. See [2Z] for an implementation of this approach (and 
jH] for some further discussion of the role of Loeb measure in the nonstandard version 
of the Gowers norms). 

Next, we recall one of the main theorems from [3], phrased in the ultralimit setting: 

Theorem 11.3 (Weak inverse Gowers conjecture). Let s be standard. Then there 
exists a standard integer d = d{s,p) ^ such that for every limit finite- dimensional 
vector space V and every bounded limit function f : V ^ *C with \\f\\u''+'>-{v) ^ 1; there 
exists P e Poly^rf(\/ ^ *T) such that |E^ei//(x)e(-P(x))| > 1. 

Proof. See [3^ Corollary 1.23]. The translation to the ultralimit setting proceeds exactly 
as in Section Ei □ 

We are now ready to deduce Theorem 11.111 from Theorem 11.201 Fix and let d be 
the minimal integer for which Theorem 111.31 holds for this value of If d ^ s, then 
we are done by Theorem 13.21 so suppose for sake of contradiction that d> s. 

By construction of we can find a limit finite-dimensional vector space V and a bounded 
limit function / : — )■ *C such that ||/||(7s+i(y) ^ 1, but such that 



whenever P is a polynomial of degree ^ d—1. By Fourier analysis (and Lemma [TTTT vi) ) . 
this implies that 



whenever (7 is a function of a bounded number of polynomials of degree ^ d — 1. In 
particular we have 



Setting /psd '■= f — /str, we now obtain the claim. 



□ 



(/,e(P))L.(v) = o(l) 



{f,9)L^v) = 0(1) 
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Using Lemma 111.11 (and Lemma ll.7( vi)). we can decompose / = fstr + /psd, where 
/strj /psd : ^ — )■ *C are bounded hmit functions, we have 



whenever g is a function of a bounded number of polynomials of degree ^ d, and for 
every £ > we can approximate fstr by a bounded limit function that is a function 
of a bounded number of polynomials of degree ^ d. In particular, by Theorem lll.3[ 



whenever P is a polynomial of degree ^ d of bounded rank. 

As the /str — fe are uniformly bounded in e, and have (the standard part of the) 
norm going to zero as £ — ?■ 0, we see (using Lemma [B.l( iii)) that 



uniformly in e. 

By Fourier analysis, we can express fe as a bounded linear combination of phases e(P), 
where the P are polynomials of degree ^ d. We separate fe = f'e + fe, where is a 
linear combination of phases e(P) of unbounded rank, and f^' is a linear combination 
of phases of e(P) bounded rank. 

From Theorem 11.201 (and Theorem 13. ip we see that ||e(P)||[/d(y) = o(l) whenever P 
has unbounded rank, and in particular (by Lemma fB.ir ii)) ||e(P)||[/s+i(y) = o(l) and 
E3.£ye(P(x)) = o(l). Since the difference of a degree d polynomial of unbounded rank 
and a degree d polynomial of bounded rank remains of unbounded rank, we also have 
Ej.gye(P(a;) — Q{x)) = o(l) whenever Q is of bounded rank. We conclude that /g, 
are essentially orthogonal in the sense that 



{fpsd, g) L^iv) = o(i) 




and hence by the triangle inequality for U^~^^{V) (Lemma IB. ll (i)) one has 

II /str II V) > 1- 

Also from the triangle inequality we see that 

(/str,e(P))L2(v) = 0(1) 



(11.1) 



Stjj/str - /e||l/»+i(V) 

as £ — > 0. For all sufficiently small e > 0, we thus have 

||/e||i7=+i(V) > 1 



(11.2) 



(/^,/^')L^(y)=0(l) 



and hence 



\\mlHV) = {feJ'e')LHV) + 0{l). 




as £ — )■ 0. We conclude that 



st|i/;iiLw^o 
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as e — )■ 0, which in particular imphes (by Lemma [B.l( iii)) that 

st||/e'lll/»+i(V) 

as e — )■ 0. Also, as is a bounded linear combination of e(P) for P of unbounded rank, 
and thus of infinitesimal U'^^^iV) norm, we see from the triangle inequality (Lemma 
EIKi)) that 

\\fe\\u^+^{V) = 0{1) 

and hence 

st||A||c/-+i(y) 

as £ — )■ 0, contradicting flll.2p . This concludes the deduction of Theorem 11.111 from 
Theorem ll.2U[ 



Appendix A. Basic theory of ultralimits 
In this appendix we review the machinery of ultralimits. 

We will assume the existence of a standard universe il which contains all the objects 
and spaces of interest for Theorem 11.111 or Theorem 11.201 such as the natural num- 
bers, standard finite-dimensional vector spaces F" and their elements, the unit circle T 
and its elements, functions from the former spaces to the latter (such as polynomials 
P G Poly^^(F"' —)■¥)), and so forth. The precise construction of this universe is not 
important, so long as it forms a set. We refer to objects and spaces inside the standard 
universe as standard objects and standard spaces, with the latter being sets whose ele- 
ments are in the former category. Thus for instance, elements of N are standard natural 
numbers, and for every standard natural number n, is a standard finite-dimensional 
vector space. Strictly speaking, the universe il cannot contain all finite-dimensional 
vector spaces, as the class of such spaces is not a set, but for the purposes of proving 
Theorem II. Ill or Theorem 1 1 . 2 1 we only need to pick one representative of each isomor- 
phism class of such spaces, such as F" for n = 0,1,2,..., and these certainly form a 
set. 

The one technical ingredient we need is the following: 

Lemma A.l (Ultrafilter lemma). There exists a collection a of subsets of the natural 
numbers N with the following properties: 

(i) (Monotonicity) If A & a and B D A, then B & a. 

(ii) ( Closure under intersection) If A,B G a, then A (1 B & a. 

(iii) (Maximality) If A G N, then either A G a or N\A G a, but not both. 

(iv) (Non-principality) If A G a, and A' is formed from A by adding or deleting 
finitely many elements to or from A, then A' G a. 



Proof. The collection of subsets of N which are cofinite (i.e. whose complement is 
finite) already obeys the monotonicity, closure under intersection, and non-principality 
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properties. Using Zorn's lemma|^, one can enlarge this collection to a maximal collection, 
which then obeys all the required properties. □ 

Throughout the paper, we fix a non-principal ultrafilter a. A property -P(n) depending 
on a natural number n is said to hold /or n sufficiently close to a if the set of n for 
which P(n) holds lies in a. 

Once we have fixed this ultrafilter, we can now define limit objects and spaces: 

Definition A. 2 (Limit objects). Given a sequence (a;n)nGN of standard objects in H, 
we define their ultralimit limn^aXn to be the equivalence class of all sequences (|/n)nGN 
of standard objects in il such that = Vn for n sufficiently close to a. Note that the 
ultralimit limn_^o can also be defined even if x^ is only defined for n sufficiently close 
to a. 

An ultralimit of standard natural numbers is known as a limit natural number, an 
ultralimit of standard real numbers is known as a limit real number, etc. 

For any standard object x, we identify x with its own ultralimit limn-s.^ x. Thus, every 
standard natural number is a limit natural number, etc. 

Any operation or relation on standard objects can be extended to limit objects in the 
obvious manner. For instance, the sum of two limit real numbers limn_j.aXn, limn^„?/n 
is the limit real number 

lim Xn + lim ?/„ = hm x^ + y-a, 

n— >a n— n— >a 

and the statement limn^Q, x^ < limn^Q Un means that x^ < y^ for all n sufficiently close 
to a. 

Remark A. 3. A famous theorem of Los asserts that any statement in first-order logic 
which is true about standard objects, is automatically true for limit objects as well. For 
instance, the standard real numbers form an ordered field, and so the limit real numbers 
do also, because the axioms of an ordered field can be phrased in first-order logic. We 
will use this theorem in the sequel without further comment. 

Definition A. 4 (Limit spaces and functions). Let (Xn)ngp^ be a sequence of standard 
spaces Xn in il indexed by the natural numbers. The ultraproduct Yln^a 
is defined to be the space of all ultralimits lim^^a ^n, where x^ € -^n for all n. Note 
only needs to be well-defined for n sufficiently close to a in order for the ultraproduct 
to be well-defined. If X is a set, the set Iln^Q"^ known as the ultrapower of X and 

■^By using this lemma, our results thus rely on the axiom of choice, which we will of course assume 
throughout this paper. On the other hand, it is possible to rephrase Theorem 11.111 and Theorem 11.201 
in the language of Peano arithmetic. Applying a famous theorem of Godel[TD], we then conclude that 
Theorem 11.201 is provable in ZFC if and only if it is provable in ZF. In fact, it is possible (with some 
effort) to directly translate these ultrafilter arguments to a (lengthier) argument in which ultrafilters 
or the axiom of choice is not used. We will not do so here, though, as the translation is quite tedious. 
In particular, the regularity lemma and equidistribution arguments in this paper will become messier, 
resembling those that appear in [52] or |28) . 
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is denoted *X. Thus for instance *N is the space of all limit natural numbers, *M is the 
space of all limit reals, etc. 

We define a limit set to be an ultraproduct of sets, a limit group to be an ultraproduct 
of groups, a limit finite set to be an ultraproduct of finite sets, and so forth. A limit 
subset of a limit set X = Hn-j-a^n ^ limit set of the form Y = Hn-j^a^n' where 
is a standard subset of Xn for all n sufficiently close to a. 

Given a sequence of standard functions : X,^ — > Y^ between standard sets Xn, Y^, 
we can form the ultralimit f = lim^^a fn to be the function / : Hn^a ~^ Iln^a 
defined by the formula 

/(lim Xn) := lim fn{xn). 

n— >a n— >-a 

We refer to / as a limit function or limit map. 

Remark A. 5. In the nonstandard analysis literature, limit natural numbers are known 
as nonstandard natural numbers, limit sets arc known as internal sets, and limit func- 
tions are known as internal functions. We have chosen the limit terminology instead as 
we believe that it is less confusing and emphasises the role of ultralimits in the subject. 

It is important to note that not every subset of a limit set is again a limit set, for instance 
N is not a limit subset of *N (this fact is known as the overspill principle). Indeed, one 
can think of the fimit subsets of a limit set as being analogous to the measurable subsets 
of a measure space. In a similar vein, not every function between two limit sets is a 
limit function; in this regard, limit functions are analogous to measurable functions. 
This analogy can be deepened by using the theory of Loeb measures, but we will avoid 
using this machinery here. 

A. 6. Asymptotic notation. By taking ultralimits, one can formalise asymptotic no- 
tation, such as the 0{) notation, in a manner that requires no additional quantifiers: 

Definition A. 7 (Asymptotic notation). A limit complex number X is said to be 
bounded if one has \X\ ^ C for some standard real number C, in which case we also 
write X — (9(1) or \X\ <S 1. More generally, given a fimit complex number X and 
limit non-negative number Y, we write |X| Y , Y ^ \X\, or X = 0{Y) if one has 
|X| ^ CY for some standard real number C. We write X = o{Y) if one has |X| ^ eY 
for every standard e > 0. Observe that for any X, Y with Y positive, one has either 
|X| ^ Y or X — o{Y). We say that X is infinitesimal ii X — o(l), and unbounded if 
1/X = o(l). Thus for instance any limit complex number X will either be bounded or 
unbounded. 

Example A. 8. The limit real limn->.a 1/n defines an infinitesimal, but non-zero, limit 
real number x; its reciprocal limn_^cK ^ is an unbounded limit real. 

Prom the Bolzano- Weierstrass theorem, every bounded limit complex number x can be 
expressed uniquely as the sum of a standard real number st{x) and an infinitesimal 
X — st{x); we refer to st(a;) as the standard part of x. 
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Appendix B. Properties of the Gowers norms 

In this appendix we record some basic properties of the Gowers norms. We use the 
normahsed norms 

||/IU.(G) :=(E.,G|/(a:)n^/^ 
for any finite non-empty set G and any f : G ^ C. 

Lemma B.l. Let G = {G, +) be a finite ahelian group, and let 1 he an integer. 

(i) The Gowers norm is a norm on functions f : G ^ C for d ^ 2, and a 
semi-norm for d = 1. In particular, we have the Gowers triangle inequahty 

\\f + gWu-iiG) ^ ll/llc/d(G) + ll^llc/rf(G) 

for f,g:G^C. 

(ii) One has the monotonicity property 

for all f : G ^ C In particular 

\^.evfix)\ = \\f\\uHG)^\\f\\uHGy 

(iii) One has the bound 

Il/||;7''(G) ^ ll/|lL2''/{d+i)(G) 

for allf-.G^C. 

(iv) One has the first Cauchy-Schwarz-Gowers inequahty 

\^xM,-,hd(^G Yl f + ^ihl + ■■■+ U)dhd)\ ^ Yl Wf^WuHG) 

we{o,i}d wG{o,i}'* 

for all {0, l^-tuples (/w)i^e{o,i}'* of functions f^^ : G ^ C, where oj := (ci;i, . . . ,0;^). 

(v) One has the second Cauchy-Schwarz-Gowers inequahty 

d 

|Ea;i,...,XdGG/(a^i + • • • + JJ Fj{xi, ...,Xd)\^ \\f\\u^{G) 

i=i 

for all f : G ^ C and Fj : — )■ C, if each Fj is bounded in magnitude by 1 
and is independent of the Xj variable. 

(vi) IfPe Poly^rf_i(G' ^T) and f -.G^C then 

ll/e(P)||c/.(G) = ll/llt/-(G)- (B.l) 

Proof. Claim (i) is proven in [121 Lemma 3.9], [17, Section 5.1] or Section 11.1]. 
Claim (ii) is proven in [121 Section 11.1]. Claim (iii) (which is also [121 Exercise 11.1.13]) 
follows easily from the recursive formula 

and induction on d, together with the special case 

(KhevWfi- + ^)fl'(-)ll^2d/(d+l)(y))^'^^ ^ ll/llL2''+V(d+2)(y)||fi'|li2d+l/(d+2)(y) 
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of Young's convolution inequality, which follows from the more traditional instance 
of that inequality by setting F := |/|2^/('^+i) and G := 

Claim (iv) is proven in Lemma 3.8], [TTl Section 5.1] or |l2l Section 11.1]; Claim 
(v) is proven in [191 Appendix B]. Claim (vi) follows immediately from the identity 
A;,, . . . AhAP) = 1 for all /ii, . . . , /id e □ 

Appendix C. Polynomial algebra 

In this appendix we review the general theory of polynomial maps (and related objects, 
such as cubes) on (filtered) groups that are not necessarily abelian (in particular, they 
may be nilpotent). This theory was initiated by Lazard [2^ and Leibman [3D], [21] 
(inspired in part by the classical Hall-Petresco formula [25j, ^34j), and further developed 
by Host and Kra [26] , [27] and by Green and the authors [H] , [21] . Our discussion here 
is largely drawn from the paper |24j . 

Polynomial algebra works on both multiplicative groups G = {G, ■) and on additive 
groups G = (G, +). For sake of concreteness we shall set out the theory here using 
multiplicative group notation, but one can of course adapt all the definitions here to 
additive groups in an obvious manner, and in fact most of the applications of this theory 
in this paper will be in the additive setting. Our conventions will be that additive groups 
are always understood to be abelian, whereas multiplicative groups are not necessarily 
abelian. 

The concepts here can be defined both in the standard and nonstandard setting, but 
again for concreteness we shall work purely in the standard universe in this appendix. 
But all of the results here can easily be phrased in the language of first-order logic 
(they involve only finitely many quantifiers) and so extend without difficulty to the 
nonstandard universe also. 

C.l. The category of filtered groups. The theory of polynomial maps is most nat- 
urally expressed in terms of a certain category of filtered groups, which we will now 
define. 

Definition C.2 (Filtered group). A filtered group G = {G, Gn) is a multiplicative group 
G = {G, ■), together with a nested sequence 

G^Go^Gi^ ... 

of subgroups Gn = (G„)ngN, obeying the commutator relation [Gi, Gj] C Gi+j for all 
i,j E N, where [Gi,Gj] is the group generated by the commutators Qi^gJ^gigj with 
gi E Gi, gj G Gj. We refer to Gfq as a filtration of G. 

A filtered group is said to have degree ^ s for some natural number s if Gi is trivial for 
all i > s. 
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Example C.3 (Abelian case). When the group G = (G, +) is additive (and thus 
abehan), a filtered group is simply a nested sequence G ^ Gq ^ Gi ^ . . . of subspaces 
(since the commutator relation is automatic in this case). In particular, for any natural 
number k ^ 0, one can give any additive group G the maximal degree k filtration 
= G\^\ defined by setting Gi equal to G when i ^ k and Gi = {0} for i > k. 

Example C.4 (Lower central series). Any group G can become a filtered group by 
taking Gi to be the lower central series of G, thus Go = Gi := G and Gj+i := [G, Gi] 
for i ^ 1. 

Remark C.5. In this paper we will only filter groups G by the natural numbers N. 
However it is sometimes convenient to filter groups by other sets, such as N^, in order to 
develop a theory of "multidegree" for polynomials of several variables; see [23]. While 
one could use this notation to describe the multilinear maps that arise in this paper, 
we have chosen not to do so here in order not to add even more terminology to what is 
already quite a notation- intensive argument. 

An obvious way to make the class of all filtered groups a category is to use the filtered 
homomorphisms (p : H ^ G between two filtered groups H = {H,Hf^), G = (G, G^), 
defined as a group homomorphism from H to G that maps each Hi to Gj. However, this 
turns out to be too small a class of morphisms for our purposes, and we will need to use 
instead the larger class of polynomial maps between two filtered groups. This concept 
can be defined in a number of different ways. The quickest way is via differentiation: 

Definition C.6 (Polynomial maps via differentiation). Let H = {H, Hj^), G = (G, Gn) 
be filtered groups, and let : — i- G be a map. For any h & H,we define the derivative 
dh4> : H ^ G oi (j) m the direction h by the formula 

dh4>{x) := (l){hx)(j){x)~^ . 

We say that the map —t- G is a polynomial map if one has 

• • • dh^(f){x) e Gi,+,„+i^ 

whenever m ^ and ii, . . . ,im G N, and hj G Hi. for all 1 ^ j ^ m. The space of all 
polynomial maps from H to G will be denoted Poly (if — ?■ G). 

Example C.7 (Non-classical polynomials as polynomial maps). If V,G are additive 
groups, with V given the maximal degree ^ 1 filtration, and G the maximal degree 
^ k filtration for some k E N, then Poly(\^ — )■ G) corresponds precisely to the space 
Poly<;fc(V —7- G) defined in Definition 11.21 (this definition was for finite-dimensional 
vector spaces V, but the definition clearly also makes sense for other additive groups). 
In particular, a non-classical polynomial P : V ^ T of degree ^ A; is also a polynomial 
map from V (with the maximal degree ^ 1 filtration) to T (with the maximal degree 
^ k filtration). 

Example C.8. Every filtered homomorphism is a polynomial map. For any g E G, the 
left translation maps x ^ gx and right translation maps x ^ gx are polynomial maps 
from G to itself. 

Remark C.9. A basic theorem of Lazard and Leibman [21], [30], [21] asserts that 
Poly(if — )■ G) is a group; see e.g. [231 Corollary B.ll]. This generalises the (obvious) 
fact that Po\j^f.{V — )■ G) is a group in the additive case. 
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A convenient fact about polynomiality is that it suffices to check it on generators: 

Proposition C.IO (Checking polynomiahty on generators). Let H, G be filtered groups, 
and for each i G N, let Ei be a set of generators for Hi. Then a map (p : H ^ G is 
polynomial if and only if 

whenever m ^ and ii, . . . , ^ and hj G Ei^ for all 1 ^ j ^ m. 

Proof See [21 Proposition B.17]. □ 

It is not immediately obvious from Definition IC.6I that the polynomial maps turn the 
class of filtered groups into a category, because one has to show that the composition 
of two polynomial maps is still polynomial. However, this can be achieved via the 
machinery of Host-Kra cube groups, which we now pause to define. 

Definition C.ll (Host-Kra cube group). Let G = {G, Gn) be a ffitered group. For any 
A; G N, we define the k'^ Host-Kra cube group BK''{G) = HK'=(G,Gn) of this ffitration 
to be the subgroup of G^^'^^ generated by those elements (fi'aj)ajG{o,i}'= which take the 
form = g ioT u & F and g^ = id otherwise, where F is a face of {0, 1}''' of some 
codimension i, and g is an element of Gj. Elements {guj)we{o,i}'= of HK'''(G) will be 
referred to as k-dimensional cu beg in G. 

We have an alternate description of these groups via a "Taylor expansion" : 

Proposition C.12. Let G = {G, G^) be a filtered group, let k & N, and let g G G^'^'^^^ . 
Then g G HK'^(G) if and only if there exist "Taylor coefficients" gj G G|j| for each 
subset J C {1, . . . , A;} such that 

JC{1,...,A:} JC{1,...,A;} 

where the subsets of {l,...,k} are ordered lexicographically (i.e. J < J' whenever 
^^.gj2~-^ < XljeJ' ^ Furthermore, the gj are determined uniquely by g. 

Proof. See [2T| Lemma 6.4]. □ 

Thus, for instance, HK^(G) consists of all tuples of the form {g^Q, googoi, 5'oo5'io, 5'oo5'oi5'io5'ii), 
where goo G Go, goi,9w e Gi, and gu G G2. 

Theorem C.13 (Polynomial maps via cubes). Let G = (G, G^) and H = {H,Hfq) be 
filtered groups, and let (p : H ^ G be a map. Then cf) is a polynomial map if and only if 
preserves cubes, in the sense that for any A; G N and {h^)^^iz{o,iY ^ HK'^(iJ), the tuple 
(0(/i..))^G{o,i}fc ^«es m HK'=(G). (In other words, the map : i/iO'i}' ^ ^{0,1}'= 

maps HK'=(i7) to HK'=(G).; 

^These cubes are also referred to as parallelopipeds in some literature, e.g. [2 7) . 
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As an immediate corollary of this theorem, we see that the composition of two polyno- 
mial maps is again polynomial, and so the class of filtered maps is now a category. If 

: i7 ^ G is a polynomial map and k e N, we use HK'=(0) : HK'=(/7) ^ HK'=(G') to 
denote the restriction of : //iO'i)' ^ G^^'i^" to HK^(i/) and HK^(G). 

Proof. When H is additive, this theorem was proven in Proposition 6.5]. We will 
give an alternate proof based on [2^ Theorem B.IO]. To use this theorem, we need a 
generalisation of the Host-Kra groups. For any natural numbers ii, . . . ,im G N, define 
the Host-Kra group HK*^' ' '*'=(G') of a filtered group G to be the subgroup of G^^'^^ 
generated by the elements of the form 

(fl'aj)ajG{0,l}'=) 

where uq G {0,1}'^, g^j^ G Gy . i , and gi_j equals g^^ when uj ^ {^o)j for all 

1 ^ J ^ ^, and is the identity otherwise. Thus for instance, when ii = . . . = = 1, 
then HK^- '^(G) = HK'=(G). It is easy to adapt the proof of Proposition IC.12I to see 
that elements of HK*^ (G) are precisely those tuples of the form fIC.ip . where each 
Taylor coefficient gj now lies in Gj2-^jij rather than G\j\. 

The result [211 Theorem B.IO] asserts that is a polynomial map if and only if (p (or 
more precisely, 0®^°'^}'°) maps HK*^'-'*'^(/f) to HK*i'-'*'=(G) for every k,ii,...,if: G N. 
In view of this result, to prove Theorem IC.13| it suffices to show that if (p maps 
to HK'=(G), then it maps HK*^'-'^'=(if) to HK^i'-'^'=(G) for every A;, ii, . . . , G N. 

For any k,ii, . . . ,ik G N. Let P{ii, . . . denote the assertion that 0®^°'^}^* maps 
HK*i' -'^'=(i7) to HK*i' By hypothesis, P(l, . . . , 1) is true for any number of I's; 

our task is to then show that P{ii, . . . ,ik) is true in general. The case A; = is trivial, 
so we may assume that k ^ 1. 

Suppose first that = 0. An inspection of the definition then shows that HK*^'"''*''"^'°(G) = 
IIK*^' - '*'=-i(G') X IIK*^' ' '**^-^(G), and similarly for H. As a consequence, we see that 
P(i 1, . . . , ik-i, 0) is implied by P{ii, . . . , ik-i)- From this observation we may assume 
without loss of generality that ik ^ 1- By symmetry, we may in fact assume that ij ^ 1 
for j = 1, . . . , k. 

Consider the map rj : ^{0.1}^- ^ G^O'i}'-''''" defined by 

v{{9ui)ue{o,i}'') '■= {guii,...,uik-iMniuik,-,^k+zf^-i))uje{o,i}'=+'k-^- (C-2) 
This is clearly an injective group homomorphism when ik ^ 1. We claim that 

^(HK^i'-'^HG)) = r/(G^°'^>') n HK^i'-'*'=+»fc-i(G), (C.3) 

where ij is equal to ij for j < k and equal to 1 for < j ^ A; + — 1. It is easy 
to see that the left-hand group in (1C.3P is included in the right-hand side, simply by 
checking what rj does to each generator of HK*^' ' '*''(G). The reverse inclusion is a little 
trickier. Suppose that g G G^^'^^'' is such that r]{g) lies in HK*i'-"'*'=+'fc-i(G). From f lC.ip 
and induction, we see that the Taylor coefficients 't]{g)j of ri{g) vanish unless J either 
contains {k, . . . , A; -|- — 1} or is disjoint from {A;, . . . , k + ik — I}- As a consequence. 



56 



TERENCE TAG AND TAMAR ZIEGLER 



each of the factors {v{9) j ^^'^ ) uje{o,i}''+*k-^ of the Taylor expasion of r]{g) are equal to 
rjlgp) for some generator gp of HK*^' ' '*''(G'), and the claim follows. 

From (IC.Sp . we see that P{ii, . . . , 4) is implies by P{ii, . . . , z^+ij = P{ii, . . . , ik-i, 1, . . . , 1), 
where ik copies of 1 appear in the latter expression. By symmetry and iteration, we 
conclude that P{ii, . . . ,ik) is implied by P(l, . . . , 1) where + . . . + ik copies of 1 appear 
in the latter expression, and Theorem IC.13I follows. □ 

Remark C.14. The group HK'^(G) itself comes with a natural filtration, with HK'^(G)j 
defined to be the Host-Kra group HK'^(G, {Gj+i)j^n) of G with the shifted filtration 
{Gj+i)j^^; see [211 Proposition B.15]. Theorem IC.13I can then be used to show that 
HK'^ can be viewed as a functor from the category of filtered groups to itself. These 
functors are related to each other by the pleasant identity HK-' o HK'^ = HK-''*''^ for all 
j, A; G N; in particular, one can define HK'^ recursively as an iteration of the functor 
HK^. We will however not adopt this perspective here. 

C.15. The additive case. Let V = {V, N) be an additive filtered group, and let k eN. 
By Proposition IC.12t HK^(V^) consists precisely of those tuples of the form 

\Jc{i,...,fc} jeJ 

with vj G V\j\ for all J C {1, . . . , A;}. Thus, for instance, HK^(V") is the space of all 
quadruples 

(^^00, ^^00 + ^^01,^^00 + ^^10, ^'OO + ^^01 + ^'lO + ^^ll) 

where foo e Vq, foi, fio e Vi, and Vn G V2. 

There is also an equivalent "dual" description of this space (which we will need to prove 
Proposition 19. 8p : 

Proposition C.16 (Description of HK'^(V^)). Let V = {V,Vf^) be an additive filtered 
group. Then for any k E N, HK^(V") consists precisely of those tuples {vui)uje{o,i}'' ^ 
V^^'^^' such that 

J2i-lpv^ G V. (C.4) 
whenever ^ i ^ k and F is a face of {0, 1} of dimension i. 

Thus, for instance, HK^(V") is the space of all quadruples (foo, "^oi, "^lo, "i^ii) such that 

VOO - Vol, Voo - Vio, Vol - Vii, f 10 - Vii G Vi 

voo - Vol - Vio + Vii G V2. 

Of course, this is equivalent to the previous description of HK^(y) after a change of 
variables. 
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Proof. Let V^''^ denote the space of all tuples (fa;)ijg{o,i}'= ^ V^'^'^^'' obeying the con- 
straints (]C.4p . This is clearly a subgroup of V^'^'^^'' . By checking the generators of 
HK^{V), we see that BK\V) C V^'^l Now we prove the reverse inclusion V^^^ C 
HK'^(y). This claim is obvious for A; = 0, so we may assume inductively that k ^ 1 and 
that the claim has already been proven for k — 1. 

Let V := {vuj)uje{o,i}'' ^ V^''^. We split v = v' + v", where 

and v" := V — v' . It is easy to see from flC.4p . that v' and v" lie in V^^\ The tuple 

lies in V^^~'^\ and hence in HK'^~^(\^) by induction hypothesis. Extending each generator 
of Y{K^~^{y) to BK^iy) by the homomorphism 

we then see that v' lies in HK'^(V^). In a similar spirit, we see from (lC.4p that the tuple 

lies in the analogue of in which the filtration {Vn)nm is replaced by the shifted 

filtration (V^+i)^^^- By induction hypothesis, this tuple thus hes in the analogue of 
HK'^~^(V"); extending each generator of this group to IIK^(V^) by the homomorphism 

(w^a;i,...,a;fc_i)a;G{0,l}'=-l ^ (^fc^oJi .....w^^i )a;G{0,l}'= ' 

we see that v" lies in HK'^(y). Thus v lies in IIK'^(\^), and the claim follows. □ 

C.17. Equidistribution. We isolate a special class of polynomial maps: 

Definition C.18 (Weak equidistribution on cubes). A polynomial map : if — t- G 
from one filtered group H = {H, H^) to another G = {G, Gn) is said to be weakly 
equidistributed on cubes if the maps HK^(0) : BK''{H) HK^(G) are surjective for 
every k eN. 

Informally, a polynomial map that is weakly equidistributed on cubes can attain every 
possible set of values on a cube in H, subject of course to the polynomiality requirement 
that this set of values must form a cube in G. 

The significance of weak equidistribution for us lies in the fact that they can be used to 
factorise polynomial maps: 

Lemma C.19 (Factorisation via weak equidistribution). Let G, H, K be filtered groups, 
let (j) : H G be a polynomial map that is weakly equidistributed on cubes, and let 
ip : G K be a map. Then ip is a polynomial map if and only if ip o ip is polynomial. 



Proof. This is immediate from Theorem IC.13[ 



□ 
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In practice, we will derive weak equidistribution from a stronger equidistribution prop- 
erty, which we formulate in the language of non-standard analysis: 

Definition C.20 (Equidistribution). Let A he a non-empty limit finite set, and let B 
be a finite set. A limit map f : A ^ B is said to be equidistributed if one has 

±\{aeA:f{a)=b}\ = j^^+o{l) 

for all b e B. 

A finite collection of functions fi : A — )■ Bi, i = 1, . . . ,k into finite sets Bi is said to 
be jointly equidistributed if the combined function {fi,...,fk) : A ^ Bi x . . . x Bk is 
jointly equidistributed. 

Definition C.21 (Equidistribution on cubes). A polynomial map (p : H ^ G from a 
limit-finite filtered group H = {H, H^) to a finite filtered group G = (G, G^) is said 
to be (strongly) equidistributed on cubes if the maps HK'^(0) : HK'^(if) — )■ IIK^(G) are 
equidistributed for every k E N. 



Observe that if A has unbounded cardinality, then every equidistributed limit map from 
A to a finite set B is automatically surjective; in praticular, strong equidistribution 
implies weak equdistribution. As a consequence of this and Lemma IC.19[ we obtain 

Corollary C.22 (Factorisation via strong equidistribution). Let G,H,K be filtered 
groups with G and Gq limit-finite with unbounded cardinality, and H finite. Let (f) : H ^ 
G be a polynomial map that is strongly equidistributed on cubes, and let ip : G ^ K be 
a map. Then ip is a polynomial map if and only if ip o cp is polynomial. 



For future reference, we observe a convenient criterion for equidistribution. 

Lemma C.23 (Weyl equidistribution criterion). Let A be a non-empty limit finite 
set, and let B be a finite abelian group. Then a limit map f : A ^ B is strongly 
equidistributed if and only if one has 

E,eAe(e(/(a))) = 0(1) (C.5) 

for all non-zero characters (i.e. homomorphisms) C, '■ B ^ T. 

Proof. By the Fourier inversion formula, the condition (IC.Sp is equivalent to the bound 

E,^AF{f{a)) = E,^BF{b) + o{l) 

holding for all standard functions F : B ^ C. But this is clearly equivalent in turn to 
the equidistribution of /. □ 



Appendix D. Properties of non-classical polynomials 



In this appendix we prove Lemma 11.71 The arguments here are analogous to those 
established in the context of dynamical systems in [3]. 
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It is convenient to introduce a ring of formal differential operators. 

Definition D.l (Differential operators). Let be a finite-dimensional or limit finite- 
dimensional vector space. A differential operator on is a formal combination (using 
addition and multiplication) of integers and additive derivatives A/^ (or equivalently, 
the shifts T/j), thus for instance 3 — 5A/j + 7AhAk + A^ is a differential operator. More 
generally, a formal differential operator is a finite or infinite series ^jfliA^. -^ . . . A^,. ^ 
where G Z, . . . , € V, and for each d there are at most finitely many i 
with di ^ d. Thus for instance A^ is a formal differential operator. Note that 
both differential operators and formal differential operators act linearly (over Z) on 
Poly^^(\^ — )■ G) for every d and G. We say that two formal differential operators on 
V are equivalent if they act the same on every space Poly^^(\^ — j- G). We let Diff(K) 
denote the space of formal differential operators on V modulo this equivalence relation; 
this is clearly a commutative ring (note that A/^A^ = Af^A^ for all h, k). 

Example D.2. li h,k E V, then we have the cocycle equation 

Ah+k = A^ + nAk (D.l) 

in Diff(l^), since we have 

Ah+kfix) = Ahfix) + Akf{x + h) 

for all / : y — )■ G and x,h,k E V. This cocycle equation can also be deduced from the 
group law 

Th+k = ThTk 

and the identity 

AH = n- 1. 

The reason for working with formal differential operators rather than genuine differential 
operators is that any formal differential operator of the form 1 + D, where D consists 
of higher order terms in the sense that it contains no constant term in its expansion, is 
invertible in Diff(V") by formal Neumann series: 

(l + D)-^ = 1-D + D^-.... 

To illustrate this, take h E V. Since ph = 0, we clearly have 

U = T,, = 1. 

Expanding Th = I + Ah and using the binomial formula, we conclude after some re- 
arrangement that 

Al = -pAhil + ^Ah + ... + Al'). 

The expression in parentheses can be inverted by formal Neumann series. We conclude 
the following fundamental fact: 

Lemma D.3 (Multiplication by p). For any h E V , we have pAh = A^ x 1^ for some 
invertible Ih G Diff(l^). Furthermore, Ih is equal to —1 plus higher order terms. 

A heuristic way to interpret this lemma is that the operation of multiplication by p 
resembles a differential operator of order p — 1; dually, ^ resembles a polynomial of 
degree p — 1. (This may help explain the condition ii + . . . + in + j {p — I) ^ d in (11. 2p .) 
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We now begin the proof of Lemma ll.7[ 

The first part of claim (i) is clear by induction on d. To prove the second part, we 
observe from the cocycle identity fID.ip that if AhP, A^P both lie in Poly^^_i(F — )■ G), 
then Ah+kP does also, and so the second part of (i) follows from the first. (One can 
also deduce (i) from Proposition IC. 101 and induction.) 

Next, we establish (ii). We begin with the one-dimensional case n = 1. For ^ d ^ P~^, 
the vector space Poly^^(K — )■ F) clearly contains the vector space spanned by the 
monomials 1, x, . . . , x'^, which are linearly independent as can be seen from computing a 
Vandermonde determinant (or using the Newton interpolation formula). On the other 
hand, the differential operator Ai : Poly^^(V^ — F) — )■ Po\j^^_i{V — > F) has kernel 
equal to the constant functions F, and so the dimension of Poly^^(y — i- F) can only 
exceed that of Poly^^_]^ by 1 at most. By induction we thus see that Poly^^(V^ — )■ F) 
is equal to the d + 1-dimensional space spanned by 1, x, . . . , x'^ for ^ d ^ p — 1. In 
particular. Poly ^p_i(y — )■ F) must be equal to the ]?- dimensional space of all functions 
from V to F, and the claim follows. 

Now we assume inductively that n > 1 and that (ii) has already been proven for smaller 
dimensions. We parameterise an element x G F" as x = (x', Xn) where x' G F'^"^ and 
x„ G F. If P G Poly^^(l^ — 7- F), then clearly the one-dimensional maps x„ i— ?■ P(x',x„) 
are polyomials of degree ^ d for each fixed x'. Applying the one-dimensional case of 
(ii), we conclude that 

P(X',X„)= P^r^i^'K" 

for some functions Pj„ : F""^ — )■ F that are uniquely determined by P. Differentiating 
this identity in times in the direction of the n^^ generator e„ of F", and d — in times in 
directions in F"~^, we conclude that each Pj^ is a polynomial of degree ^ d — in- The 
claim then follows from the induction hypothesis. 

We skip (iii) for the moment and move on to (iv). If h E V and P, Q : V ^ R, we have 

T,(pg) = (r,p)(T,g); 

expanding T/j = 1 + A/j, we conclude the discrete Leibniz rule 

A,(pg) = (A,p)g + p(A,g) + (A,p)(A;,g). (d.2) 

The claim (iv) can now be easily established by an induction on d + d' (noting that 
the claim is trivial if d or d' is negative). (Alternatively, one can deduce (iv) from [2U 
Example B.18].) 



We remark that one should view the final term {AhP){AhQ) in (ID.2P to be a lower 



order error term, so that (ID.2P becomes a perturbation of the classical Leibniz rule 



D{PQ) = {DP)Q + P{DQ) for derivations D. 

Now we prove part of (v). If P G Poly^^(V^ — )■ G) for some d ^ p — 1, then 

A,,...A,,_^^,P = 
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for any hi, ... , hd-p+2 G V, and thus by Lemma [D. 31 

We conclude that pP G Poly^^„p^]^(V" — t- G). We conclude that for any integer d, the 
map p : P ^ pP maps Poly^^(l^ G) to Poly^jna^(rf_p+i_o)(^ ^ G*)- This proves 
everything in (v) except for the assertion that this map P h- )■ pP is surjective. 

Now we return to (iii) . The fact that every expression P of the form (11.21) is a polynomial 
of degree ^ d, and vice versa follows from the special case of Proposition 110.41 when all 
the initial degrees Di are equal to 1. (This argument is non-circular, because Lemma 
ll.7r iii) is not used in the proof of Proposition 110.41 Another proof of this part of Lemma 
OTiii) can be found in [til §1.12]. 

Now we establish the uniqueness claim in (iii). The claim is trivial for d = 0, so suppose 
inductively that d ^ 1 and that uniqueness has already been established for smaller 
values of d. Since a = P{0) from (II. 2p we see that the a are unique and can thus be 
subtracted away. Applying the uniqueness claim to the lower-degree polynomial pP, we 
see that all the coefficients in (II. 2p with j ^ 1 are unique. Subtracting off these terms 
also, we are left with a classical polynomial expansion (II. ip . and the claim follows from 
(ii). This concludes the proof of (iii). 

Now that we have (iii), the surjectivity claim of (v) is immediate, since one simply 
replaces all the p'^^ denominators in (II. 2p by p'^'^, and replaces a with a p^^ root a' as 
in the proof of (iii). This completes the proof of (v). 

The claim (vi) follows immediately from (v) and an induction on d. 

Appendix E. On exact roots in dynamical systems 

In [3], [13], the inverse conjecture for the Gowers norms were attacked via an ergodic 
theory approach, based on a structual analysis of -systems. These systems consisted 
of a probability space (X, /x), together with a measure-preserving action (Tg)ggF" of 
the infinite vector space F'^ := IJ^i (where we nest F" inside F""*"^ in the obvious 
manner) . 

A polynomial of degree ^ d on such a system is a measurable function P : X — )■ T such 
that A;,, . . . A;,,^^P = a.e. for all /ii, . . . , h^+i e F-, where AJ := / o T^, - 1. The 
analogue of the exact roots property from Remark 11.91 is then 

Claim E.l. Let X = {X, B, fi, {Tg)g^Y'^) be a -system, let d ^ be an integer, and let 
P : X T be a polynomial of degree ^ d. Then there exists a polynomial Q : X ^ T 
of degree ^ d -\- p — 1 with pQ = P. 

Using Lemma fl.7r v). it is not difficult to verify this claim when X is finite; the claim 
is also easy when = 1, as one can then (up to a constant) express P = l{P) for some 
linear polynomial P : X — F, and one can verify that the polynomial Q := e(|P|/p^) will 
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have the desired properties (note from Lemma [1.7( iii) that the map x i— )■ mod 1 

has degree ^ p on F). Unfortunately, the claim fails in general. For instance, we have 

Proposition E.2. Claim \Kl\ is false when p = 2 and d = 2. 

The purpose of this appendix is to prove this proposition, which explains why we were 
unable to use the ergodic theory method from [13], [5] to establish the main results in 
this paper. 

The reason that the exact roots property holds in the finitary setting but not in the 
ergodic setting can be explained as follows. In the finite setting Fg, all functions are 
considered to be measurable; but in the ergodic setting, only a limited number of func- 
tions are measurable. For instance, one may be working in a system generated by a 
single function / and its shifts T^f, so that every measurable function in the system can 
be approximated to arbitrary accuracy by a finite combination of shifts T^f of these 
functions. 

Now consider the finitary quadratic function P := ^ mod 1 on Fj , which is the analogue 
of the function L4^{t) considered above. This function has a degree ^ 3 root, namely 
the function Q '■= ^ mod 1. However, this polynomial Q is not "measurable" in the 
system generated by P, in the sense that one cannot express (even approxmiately) Q 
as a function of a bounded number of shifts AhP of P; indeed, one can formulate this 
precisely and then deduce this from the arguments in the proof of Proposition IE. 21 given 
below, combined with the Furstenberg correspondence principle, but we will not do this 
here. 

On the other hand, if we allow ourselves the freedom to extend the system X to a larger 
one, then it appears that one can recover the exact roots property. For instance, in 
the system X = F^ Xp Z/4Z defined below, we may extend this system to the system 
Y := F'^ Xpg where the cocycle ps is defined exactly as with p — p4 but using the 

modulus 8 rather than 4. Letting tg be the vertical coordinate function of Y, the degree 
^ 2 polynomial L4{t) in X then lifts to 2^8(^8) in where ls : Z/8Z — )• {0, 1 . . . , 7} 
is the obvious map. This has an obvious root that is of degree ^ 3, namely Lsits)', 
this function is the ergodic analogue of the finitary function Q = ^ mod 1 considered 
earlier. It is likely that this phenomenon generalises, in that Claim IE. II becomes true 
again if we allow Q to take values in an extension of P, but we will not pursue this 
statement here. (Note though that one can use extensions to simplify the proofs of 
various multiple recurrence and convergence results in ergodic theory; see for instance 



In principle, this weakened form of Claim lE.H in which the root takes values in an 
extension, may possibly be used to extend the ergodic theory arguments in [3] to the 
low characteristic setting, and in particular to recover the ergodic version of the inverse 
conjecture for the Gowers norms for F'^ (see [3], [13]) in that setting. However, we were 
not able to achieve this, as the inductive arguments in [3] rely on reducing the system 
and are thus not compatible with taking extensions. 
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We now begin the formal proof of Proposition IE.2[ Let F := F2. Consider the Cantor 
space = HneN^^ (with the product cx-algebra and the uniform (Bernoulh) proba- 
bihty measure), which has an obvious action of F'^. If we let a;j : F^ — )■ F, -j G N be the 
coordinate functions, we thus have 

for i,j G N, where ei, 62, ■ ■ ■ are the generators of F*^ and 6ij is the Kronecker delta. In 
particular, the functions L{xj) : X ^ T are polynomials of degree ^ 1. 

We define a cocycle p = p4 : F"^ x F^ — )■ Z/4Z taking values in the cyclic group Z/4Z 
by the formula 

p(^ei,x) := ^(1-21x^1) mod 4 (E.l) 

ieA ieA 

for any finite set A. One easily verifies the cocycle equation 

p{h + fc, x) = p(/i, x) + p{k, Thx) 

for any h^k G F'^. We can therefore build the cocycle extension X := F^ Xp Z/4Z of 
F^, defined as the space of pairs {x,t) with x G F^ and t G Z/4Z (with the product 
probability measure) endowed with the shift 

Th{x,t) := {ThX,t + p{h,x)). 

This can be easily verified to be a F'^-system. If we let ^.4 : Z/4Z — )■ T be the map 
L4:{i) := i/4:, and t : {x,t) ^ t be the coordinate function, then the function 14(1) has 
derivatives 

AhLi{t) = Liipih, ■)), 

and thus by (lE.ip . 64 (t) is a polynomial of degree ^ 2. Similarly, 2^4 (t) is a polynomial 
of degree ^ 1. 

If Claim IE. II was true, then there would exist a polynomial Q : X ^ T of degree ^ 3 
such that 2Q = 14(1) . To show that this is not possible, we need to classify all the 
polynomials of degree ^ 3. 

We begin with the polynomials of degree ^ 0, which (up to measure zero errors) are 
simply the F'^-invariant functions P : X — T. We claim that X is ergodic, so that 
the only invariant functions are the constants (up to measure zero errors). It suffices 
to show that every invariant set E in X has zero measure or full measure. Given any 
e, we can approximate E to an error of measure e by a set E^ which depends on only 
a finite number xi, . . . ,x„ of the base coordinate functions, together with the vertical 
coordinate t. The set E^ is then invariant up to errors of measure 0{e). Inspecting the 
action of the shift Te^^^ on E^, one then easily concludes that E^ must differ by an error 
of 0{e) from a set which does not depend on the vertical coordinate. Taking e — )■ 0, we 
conclude that E is (up to measure zero errors) independent of the vertical coordinate, 
and thus descends to an invariant subset of F^. But it is standard from the theory of 
Bernoulli systems that such sets have either zero measure or full measure. 

Now we classify the polynomials of degree ^1. If P has degree ^ 1, then for each Cj, 
Ae-P is degree ^ and hence constant. Since A2eiP is necessarily trivial, we conclude 
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from the cocycle equation A/^+fc = Ah + ThA^ with h = k = Ci that 2Af..P = 0. Thus we 
can find coefficients Cj G F such that Af.-P = Ci for all i. Also, since P is measurable, it 
differs by an error of e (in measure) from a function which depends on only ffnitely many 
of the xi, . . . ,Xn and t. Note that ii rii > n2 > n and {x,t) are such that x^ 7^ Xn2, 
then the shift Te„^_e„2 does not affect the xi, . . . , Xn, t coefficients. On the other hand, 
^e„^-e„2-P = P + Cni — c^j- and thus (if e is small enough) we have = c independent 
of rii for rii > n. We now see that P differs from the degree ^ 1 polynomial 

n 

^(q - c)i(Xi) + 2t4(t) 

1=1 

by a degree ^ polynomial, which is thus constant. Thus all degree ^ 1 polynomials 
take the form 

P = CiL{xi) + 2dLi{t) + a 

i 

where a G T, G Z, and at most finitely many of the Cj G Z are non-zero. 

To classify polynomials of higher degree we employ the method of vertical differentiation, 
which is used extensively in the ergodic theory literature (see for instance [26], |16], [3]); 
for this simple example we use a very concrete instance of this method here. We define 
the vertical derivatives A^P of a function P : X for any s G 'L/4:'L by the formula 

AsP{x, t) := P{x, t + s)- P(x, t). 

Observe that these operators commute with themselves and with the F*^ action, and in 
particular commute with the ordinary derivatives Ah. 

The key observation is that A2 behaves like a differential operator of order two: 

Lemma E.3. Let s = 1,2. //P : X — t- T has degree ^ d for some integer d, then A^P 
has degree ^ d — s. 

Proof. By repeated differentiation it suffices to verify this when d = s — 1. But this 
follows from the classification of polynomials of degree ^ and degree ^ 1 that has 
already been established. □ 

We can now classify polynomials of degree ^ 2 (cf. Lemma [1.7( iii)): 

Lemma E.4 (Classification of quadratics). Let P : X ^ T be of degree ^ 2. Then we 
can write 

P = + E^^X + + rf^ + + a mod 1 (E.2) 

i<j i i 

where Cij,Ci,di,d,e are integers, of which only finitely many are non-zero, a G T, and 
Si : X ^ ¥ is the function Si := t mod 2, and x 1— |x| for x G Z/4Z is the obvious 
map to the fundamental domain {0, 1, 2, 3}. 

Proof. By a computation, one verifies that all the expressions on the right-hand side of 
flE.2p are indeed polynomials of degree ^ 2. 
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From Lemma [E.3[ A2P is constant. Using the cocycle identity 

A2P(x,t) + A2P(x,t + 2) = (E.3) 

we see that this constant is either or 1/2. In the latter case, we can subtract off |t|/4 
to reduce to the former case (noting that A2|t|/4 = 1/2); so we may assume without loss 
of generality that A2P = 0, thus P descends to the reduced system F'^ Xp mod 2 Zi/2Z. 

From Lemma [E.3I again. AiP is linear, and thus takes the form 

i 

for some integers di (of which only finitely many are non-zero) and /3 G T. With the 
cocycle identity 

AiP(x, t) + AiP(x, t+l)= A2P{x, t) = (E.4) 
we see that /3 is either or 1/2, thus /3 = d/2 for some integer d. If we then subtract off 
di l^'l^'^^l + d^-^ from P (noting that Ai^i = 1) we can reduce to the case AiP = 0, 
thus P now descends to a function of F^. 

For any e, we may approximate P in measure to error £ by a function depending only 
on finitely many xi, . . . ,Xn of the coefficients. For any n' > n, the linear polynomial 
Ae^,P is then within 0{e) of zero in measure, and is thus constant (by the classification 
of linear polynomials); using the cocycle identity = ^e^,P + T^^,/\f.„iP we see that 
Ae^,P is in fact identically zero, thus P in fact descends to a function of just a finite 
number of coordinates xi, . . . , Xn- The claim now follows from Lemma [1.7r ii). □ 

In a similar vein, we can classify cubics: 

Lemma E.5 (Classification of cubics). Let P : X he of degree ^ 3. Then P is an 
integer linear combination of a finite number of the following functions: 

(i) Constants a G T; 

^11^ I OC '1 1 1 X j 1 1 X 1/2 for natural numbers i < j < k; 

(iii) |a;j||a;j|/4 for natural numbers i < j; 

(iv) |xj|/8 for a natural number i; 

1 1 5*1 1/2 for natural numbers i < j; 

(vi) |a;j||S'i|/4 for a natural number i; 

(vii) liSil/S for a natural number i; 

(viii) |xj||S'2|/2 for a natural number i; 

(ix) 1 5i 1 1^2 1/2; 

(x) |52|/2. 

Here S2 '■ X ^ ¥ is the function such that 5*2 (x,t) = 1 when t = 2,3 mod 4 and 
5*2 (x,t) = otherwise. 

Remark E.G. The polynomials 5*1, 5*2 can be viewed as the ergodic limit as n — )■ 00 
(using the Furstenberg correspondence principle) of the symmetric polynomials 5*1, 5*2 
considered in Example II. 16^ where the coordinate functions Xi correspond to the usual 
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coordinate functions on , and t corresponds to L mod 4. The cubic polynomial L4^{t) = 
\t\/4 can be expressed as |t|/4 = \S2\/2 + |5i|/4; cf. ffT3D . 



Proof. By a (somewhat tedious) computation we see that all the above functions are 
polynomials of degree ^ 3 (and that 5*2 is a polynomial of degree ^ 2). 



From Lemma [E.3| A2P is of degree ^ 1, so we can express 

A2P = ^Q|xi|/2 + c|^i|/2 + a 

i 

for some integers Cj, c (only finitely many of which are non-zero) and a G T. Using flE.Sp 
we can write a = d/2 for some integer d. By subtracting J2i Ci|^j| I 1/2 + c| 5*1 1 15*21/2 + 
d\S2\/2 from P we may thus assume that A2P = 0. 

Next, AiP is of degree ^ 2, and is annihilated by A2, so by Lemma [E.4t we have 
AiP = + ^ + ^ c?*^^^ + d^-^ + a mod 1 

i<j i i 

for some integ di, d (only finitely many of which are non-zero) and a G T. 

Using flE.4p . we conclude that the must vanish, and that ^ + 2a = 0; thus we can 
simplify the above expression to 

A,p = + - + - ^) + 1 ^ 

i<j i 

for some integer e. If we then subtract off the cubic polynomial 

E Xj 5*1 7 P'i "^1 , , 
Cii h > di h d 
^J 2 ^ 4 8 2 

Kj t 

from P, we can reduce to the case AiP = 0, thus P descends to F^. Arguing as in 
Lemma \EA\ we conclude that P is a function of finitely many coordinates xi, . . . ,Xn, 
and the claim follows from Lemma [1 .7( 111). □ 



From the above lemma, we see that if P is a poljTiomial of degree ^ 3, then 2P is an 
integer linear combination of the following types of functions: 

(i) Constants a G T; 

(ii) |xj||xj|/2 for natural numbers i < j] 
(ill) |xj|/4 for a natural number i; 

(iv) I 1 1 5*1 1/2 for a natural number i; 

(v) 1 5*1 1/4 for a natural number i. 

In particular, A2(2P) must vanish. On the other hand, A2L4,{t) = ^ 7^ 0. Thus 14(1) is 
not of the form 2P for a polynomial of degree ^ 3, thus establishing Proposition IE.2I 
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Remark E.7. One can avoid the full classification of polynomials of degree ^ 3 in 
proving the above proposition. Indeed, if P is cubic, then from Lemma TE-SI AoAqP = 0, 
and then by (IE.3P A2(2P) = 0, and we can conclude as above. 
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